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HUMAN PHOSPHATIDIC ACID PHOSPHATASE 

Field of the Invention 

This invention relates to human phosphatidic acid 
phosphatase. More particularly, this invention relates 
to three variants of human phosphatidic acid phosphatase 
namely PAP-a(l and 2), PAP-/3 and PAP-y and uses thereof. 
The invention encompasses biotechnology inventions, 
including biotechnology products and processes. 

Background of the Invention 

Phosphatidic acid phosphatase (PAP) (also referred 
to in the art as phosphatidate phosphohydrolase) is known 
to be an important enzyme for glycerolipid biosynthesis. 
In particular, PAP catalyzes the conversion of 
phosphatidic acid (PA) (also referred to in the art as 
phosphatidate) into diacylglycerol (DAG) . DAG is an 
important branch point intermediate just downstream of PA 
in the pathways for biosynthesis of glycerophosphate - 
based phospholipids (Kent, Anal. Rev.Biochem. 64: 315- 
343, 1995) . 

In eukaryotic cells, PA, the precursor molecule for 
all glycerophospholipids, is converted either to CDP- 
diacylglycerol ( CDP-DAG) by CDP-DAG synthase (CDS) or to 
DAG by phosphatidic acid phosphatase (PAP) . In mammalian 
cells, CDP-DAG is the precursor to phosphatidylinositol 
(PI), phosphatidylglycerol (PG) , and cardiolipin (CL) ; 
whereas diacylglycerol is the precursor to 
triacylglycerol (TG) , phosphatidylethanolamine (PE) , and 
phosphatidylcholine (PC) in all eukaryotic cells. 
Therefore, the partitioning of phosphatidic acid between 
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CDP-diacylglycerol and diacylglycerol is an important 
regulatory point in eukaryotic phospholipid metabolism 
(Shen et al . , J. Biol. Chem. 271: 789-795, 1996). 

In addition to being an important enzyme for 
5 glycerolipid biosynthesis, PAP is also an important 

enzyme for signal transduction. PAP catalyses the 
dephosphorylation of PA to DAG. DAG is a well -studied 
lipid second messenger which is essential for the 
activation of protein kinase C (Kent, Anal. Rev.Biochem. 
10 64: 315-343, 1995); whereas PA itself is also a lipid 

messenger implicated in various signaling pathways such 
as NADPH oxidase activation and calcium mobilization 
(English, Cell Signal. 8: 341-347, 1996). The regulation 
of PAP activity can therefore affect the balance of 
15 divergent signaling processes that the cell receives in 

terms of PA and DAG (Brindley et al . , Chem. Phys . Lipids 
80: 45-57, 1996) . 

Various forms of PAP have been isolated in porcine 
(Kai et al., J. Biol. Chem. 271: 18931-18938, 1996) and 
20 rat species (Brindley et al., Chem. Phys. Lipids 80: 45- 

57, 1996) . Furthermore, the putative amino acid sequence 
of murine PAP has been identified. Kai et al . , supra. 
Prior to the instant invention, however, human PAP had 
not been identified or isolated. 
25 Genes coding for PAP have been identified in E. coli 

(Dillon et al, J. Biol. Chem. 260: 12078-12083, 1985) and 
in mouse (Kai et al . , J. Biol. Chem. 271: 18931-18938, 
1996) . Furthermore, the following GenBank human cDNA 
clones are available: accession nos. H17855, N75714, and 
30 W70040. No uses were known, however, for these 

polynucleotide sequences . 

Accordingly, there is a need for the identification 
and isolation of human PAP and for methods of using human 
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PAP # for example, for the dephosphorylation of a 
substrate . 

Summary of the Invention 

It is therefore an object of the present invention 
to provide a polynucleotide sequences encoding three or 
more variants of human PAP, namely PAP-a(l and 2), PAP-/3 
and PAP-y. 

It is a further object to provide the isolated 
protein of these three variants. 

It is yet a further object to provide a 
biotechnology method for preparing these variants via 
recombinant methods . 

It is a further object to provide a biotechnology 
method of using these variants or human PA in general to 
synthesize DAG. 

In accomplishing these and other objects there is 
provided an isolated polynucleotide encoding human 
phosphatidic acid phosphatase wherein the polynucleotide 
encodes a protein comprising a polypeptide sequence 
selected from the group consisting of (i) the sequence at 
amino acid number 1 to amino acid number 284 (SEQ ID 
N0:2) in Figure 1, (ii) the sequence at amino acid number 
1 to amino acid number 285 (SEQ ID NO: 4) in Figure 2, and 
(iii) the sequence at amino acid number 1 to amino acid 
number 2 76 (SEQ ID NO: 8) in Figure 4. 

There is further provided an isolated human 
phosphatidic acid phosphatase protein, wherein the 
protein comprises a polypeptide sequence selected from 
0 the group consisting of (i) the sequence at amino acid 

number 1 to amino acid number 2 84 (SEQ ID N0:2) in Figure 
1, (ii) the sequence at amino acid number 1 to amino acid 
number 285 (SEQ ID NO:4) in Figure 2, and (iii) the 
sequence at amino acid number 1 to amino acid number 27 6 
5 (SEQ ID NO: 8) in Figure 4. 
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There if further provided a method of preparing a 
human phosphatidic acid phosphatase-/? protein comprising 
the steps of (i) transforming a host cell with an 
expression vector comprising a polynucleotide encoding 
5 human phosphatidic acid phosphatase, (ii) culturing the 

transformed host cells which express the protein and 
(iii) isolating the protein. 

There if further provided a method of 
dephosphorylating a substrate comprising contacting the 
10 substrate with an effective amount of isolated human 

phosphatidic acid phosphatase protein such that the 
protein catalyzes the dephosphorylation of the substrate. 
It is further provided that the substrate of this method 
is selected from the group consisting of phosphatidic 
15 acid, lysophosphatidic acid, ceramide 1-phosphate, and 

sphingosine 1-phosphate. It is further provided that 
this method occurs in vitro, and comprises a step of 
isolating the dephosphoryled substrate. Additionally, 
the method can occur in vivo, and is effected by the 
20 administration of human phosphatidic acid phosphatase to 

a mammal in need thereof . 

Brief Description of the Drawings 

Figure 1 shows the DNA sequence of the cDNA insert 
25 of the human PAP-orl isolated herein and the corresponding 

amino acid sequence (SEQ ID NOS : 1 and 2) . 

Figure 2 shows the DNA sequence of the cDNA insert 
of the human PAP-o?2 isolated herein and the corresponding 
amino acid sequence (SEQ ID NOS: 3 and 4) . 
30 Figure 3 shows the DNA sequence of the cDNA insert 

of the human PAP-jS isolated herein and the corresponding 
amino acid sequence (SEQ ID NOS: 5 and 6) . 

Figure 4 shows the DNA sequence of the cDNA insert 
of the human PAP-7 isolated herein and the corresponding 
35 amino acid sequence (SEQ ID NOS : 7 and 8) . 
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Figure 5 shows amino acid sequences alignment of the 
murine PAP coding sequence and the coding sequences for 
human PAP-o?(l and 2) , PAP-jS and PAP-7 (SEQ ID NOS:9-13) . 

Figure 6 shows the effect of IL-ljS on PAP-/3 
expression in human endothelial ECV3 04 cells using 
Northern blot analysis. 

Figure 7 depicts a thin layer chromatography 
analysis demonstrating the increase in PA 
dephosphorylation in cells transfected with either the 
PAP-al or PAP - oc2 cDNA expression plasmids . 

Figure 8 shows the differential expression of PAP- 
a mRNA in various tumor versus normal tissues. 

Figure 9 is a schematic representation of 
glycerophospholipid biosynthesis involving the conversion 
of PA to either DAG or CDP-DAG. The synthesis of PA to 
DAG involves the PAP enzyme, while the synthesis of . PA to 
CPD-DAG involves the CDS enzyme. 

Detailed Description of Preferred Embodiments 

0 This invention relates to isolated human 

phosphatidic acid phosphatase. More particularly, this 
invention relates to three variants of human phosphatidic 
acid phosphatase namely PAP-a(l and 2) , PAP-/3 and PAP-7. 
Examples of the uses for human PAP include the 

5 following. PAP is an important tool for enzymatic 

catalysis of several biologically significant proteins.. 
As discussed above, PAP catalyzes the dephosphorylation 
of PA to DAG. DAG, in turn, is essential for the 
activation of protein kinase C (Kent, Anal. Rev.Biochem. 

0 64: 315-343, 1995) . 

Moreover, PAP catalyzes the dephosphorylation of 
lysophosphatidic acid (LPA) , ceramide 1 -phosphate (C-l- 
P) , and sphingosine 1-phosphate (S-l-P) (Brindley et al . , 
Chem. Phys. Lipids 80: 45-57, 1996) . In the case of LPA, 

i5 S-l-P, and C-l-P, the products of the PAP reaction are 

monoacylglycerol , sphingosine, and ceramide, 
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respectively. PAP can control the balance of a wide 
spectrum of lipid mediators of cell activation and signal 
transduction by modulating the phosphorylated state of 
these lipids. 

5 Additionally, the human PAP of the present invention 

are likely to define a new family of tumor suppressor 
genes that can be used as candidate genes for gene 
therapy for the treatment of certain tumors. The 
relationship of PAP and tumor suppression is evidenced 
10 in findings that PAP activity is lower in fibroblast cell 

lines transformed with either the ras or fps oncogene 
than in the parental rati cell line (Brindley et al . , 
Chem. Phys. Lipids 80: 45-57, 1996). Decrease in PAP 
activity in transformed cells correlates with a 
15 concomitant increase in PA concentration. Moreover, 

elevated PAP activity and lower level of PA has been 
observed in contact -inhibited fibroblasts relative to 
proliferating and transformed fibroblasts (Brindley et 
al., Chem. Phys. Lipids 80: 45-57, 1996). Therefore, PAP 
20 plays a role in decreasing cell division and as such can 

provide a useful tool in treating cancer. 

Additionally, PA, the substrate for the enzyme PAP, 
has been implicated in cytokine induced inflammatory 
responses (Bursten et al . , Circ. Shock 44: 14-29, 1994; 
25 Abraham et al . , J . Exp. Med. 181: 569-575, 1995; Rice et 

al., Proc. Natl. Acad. Sci. USA 91: 3857-3861 1994; Leung 
et al., Proc. Natl. Acad. Sci. USA 92: 4813-4817, 1995) 
and the modulation of numerous protein kinases involved 
in signal transduction (English et al . , Chem. Phys. 
30 Lipids 80: 117-132, 1996) . Because of the possibility 

that activation of human PAP expression can counter- 
balance the inflammatory response from cytokine 
stimulation through degradation of excess amount of PA in 
cells, the genes encoding human PAP can be used in gene 
35 therapy for the treatment of inflammatory diseases. 
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Human PAP described herein can also be used in gene 
therapy for the treatment of obesity associated with 
diabetes. PAP activity is decreased in the livers and 
hearts of the grossly obese and insulin resistant JCR:LA 
corpulent rat compared to the control lean phenotype 
(Brindley et al . , Chem. Phys . Lipids 80: 45-57, 1996). 
Human PAP described herein therefore can provide an 
important tool for the treatment of obesity associated 
-with diabetes. 



1. Human PAP 

As used herein, "phosphatidic acid phosphatase" or 
"PAP" refers to a protein capable of catalyzing the 
dephosphorylation of PA to DAG. PAP also includes 
15 proteins capable of catalyzing the dephosphorylation of 

lysophosphatidic acid (LPA) , ceramide 1-phosphate (C-l- 
P) , and sphingosine 1-phosphate (S-l-P) . 

As used herein, "isolated" PAP denotes a degree of 
separation of the protein from other materials endogenous 
20 to the host organism. As used herein, "purified" denotes 

a higher degree of separation than isolated. A purified 
protein is sufficiently free of other materials 
endogenous to the host organism such that any remaining 
materials do not adversely affect the biological 
25 properties of the protein, for example, a purified 

protein is one sufficiently pure to be used in a 
pharmaceutical context. 

As used herein, "human" PAP refers to PAP naturally 
occurring (or "native") in the human species, including 
30 natural variations due to allelic differences. The term 

"human PAP," however, is not limited to native human 
proteins, but also includes amino acid sequence variants 
of native human PAP that demonstrate PAP activity, as 
defined above.. 

35 Variants often exhibit the same qualitative 

biological activity as the naturally-occurring analogue, 
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although variants also are selected in order to modify 
the characteristics of PAP protein. In a preferred 
embodiment, therefore, human PAP includes the amino acid 
sequences of Figures 1-4 (SEQ ID NOS : 2 , 4, 6 and 8), 
5 being PAP-al, PAP-a2, PAP-/3 and PAP-7, respectively and 

variants thereof . 

Amino acid sequence variants of the protein can be 
substitutional, insertional or deletion variants. 
Deletion variants lack one or more residues of the native 
10 protein which are not essential for biological activity. 

An example of a common deletion variant is a protein 
lacking transmembrane sequences. Another example is a 
protein lacking secretory signal sequences or signal 
sequences directing the protein to bind to a particular 
15 part of a cell. 

Substitutional variants typically contain the 
exchange of one amino acid for another at one or more 
sites within the protein, and are designed to modulate 
one or more properties of the protein such as stability 
20 against proteolytic cleavage. Substitutions preferably 

are conservative, that is, one amino acid is replaced 
with one of similar shape and charge. Conservative 
substitutions are well known in the art and include, for 
example, the changes of: alanine to serine; arginine to 
25 lysine; asparigine to glutamine or histidine; aspartate 

to glutamate; cysteine to serine; glutamine to 
asparigine; glutamate to aspartate; glycine to proline; 
histidine to asparigine or glutamine; isoleucine to 
leucine or valine; leucine to valine or isoleucine; 
3 0 lysine to arginine, glutamine, or glutamate; methionine 

to leucine or isoleucine; phenylalanine to tyrosine, 
leucine or methionine; serine to threonine; threonine to 
serine; tryptophan to tyrosine; tyrosine to tryptophan 
or phenylalanine; and valine to isoleucine or leucine. 
3 5 Of course, other amino acid substitutions can be 

undertaken. 
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Insertional variants contain fusion proteins such 
as those used to allow rapid purification of the protein 
and also can include hybrid proteins containing sequences 
from other proteins and polypeptides which are protein 

5 homologues . 

Variants of human PAP also include fragments, 
analogs, derivatives, muteins and mimetics of the natural 
PAP protein that retain the ability to cause the 
beneficial results described above. Fragments of the 

0 human PAP protein refer to portions of the amino acid 

sequence of the PAP polypeptide that also retain this 
ability. 

Variants can be generated directly from the human 
PAP protein itself by chemical modification by 

5 proteolytic enzyme digestion, or by combinations thereof. 

Additionally, methods of synthesizing polypeptides 
directly from amino acid residues also exist. 

Non-peptide compounds that mimic the binding and 
function of the human PAP protein {"mimetics") can be 

0 produced by the approach outlined in Saragovi et al . , 

Science 253: 792-95 (1991). Mimetics are peptide- 
containing molecules which mimic elements of protein 
secondary structure. See, for example, Johnson et 
al. , "Peptide Turn Mimetics" in BIOTECHNOLOGY AND 

5 PHARMACY, Pezzuto et al . , Eds., (Chapman and Hall, New 

York, 1993) . 

The underlying rationale behind the use of peptide 
mimetics is that the peptide backbone of proteins exists 
chiefly to orient amino acid side chains in such a way 
0 as to facilitate molecular interactions For the purposes 

of the present invention, appropriate mimetics can be 
considered to be the equivalent of the human PAP protein 
itself . 

More typically, at least in the case, of gene 
5 therapy, variants are created by recombinant techniques 

employing genomic or cDNA cloning methods. Site-specific 
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and region-directed mutagenesis techniques can be 
employed. See CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 
vol. 1, ch. 8 (Ausubel et al . eds . , J. Wiley & Sons 198 9 
& Supp. 1990-93); PROTEIN ENGINEERING (Oxender Sc Fox 
5 eds., A. Liss, Inc. 1987). In addition, linker- scanning 

and PCR-mediated techniques can be employed for 
mutagenesis. See PCR TECHNOLOGY (Erlich ed. , Stockton 
Press 1989) ; CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 
vols. 1 Sc 2, supra. Protein sequencing, structure and 
10 modeling approaches for use with any of the above 

techniques are disclosed in PROTEIN ENGINEERING, loc. 
cit. and CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, vols. 1 
& 2 , supra. . 

2 . Polynucleotides Encoding Hu man PAP 

The present invention further includes isolated 
polynucleotides encoding human phosphatidic acid 
phosphatase. As used herein, an "isolated" 

polynucleotide denotes a degree of separation of the 
20 polynucleotide from its naturally occurring environment, 

e.g., from its native intact genome. In a preferred 
embodiment, the isolated polynucleotides correspond to 
those shown in Figure 1 at nucleotide number 342 to 
nucleotide number 1193 of SEQ ID NO:l; Figure 2 at 
25 nucleotide number 342 to nucleotide number 1196 of SEQ ID 

NO: 3; Figure 3 at nucleotide number 294 to nucleotide 
number 1226 of SEQ ID NO: 5; and Figure 4 at nucleotide 
number 4 to nucleotide number 833 of SEQ ID NO:7. 

The invention furthermore relates to a 
3 0 polynucleotide whose sequence is degenerate with respect 

to the sequences mentioned above in accordance with the 
nature of the genetic code. Degeneracy is often referred 
to as codon/anticodon wobble, and is discussed in Watson 
et al., MOLECULAR BIOLOGY OF THE GENE (4th ed. 1987) at 
35 437-43. 
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.The present invention further includes bases # 
nucleosides, nucleotides, oligonucleotides derived from 
the isolated polynucleotides of the present invention. 
The term "derived" when used in the context of the 
5 present invention connotes a degree of similarity that 

is sufficient to indicate the original polynucleotide 
from which hybrid forms, or portions thereof, were 
obtained. Also within the scope of the invention are so- 
called "polyamide" or "peptide" nucleic acids ("PNAs" ) 
0 derived from the polynucleotides of the present 

invention. PNAs are constructed by replacing the 

(deoxy) ribose phosphate backbone of a subject 
polynucleotide with an achiral polyamide backbone or the 
like. See Nielsen et al . , Science 254: 1497-54 (1991). 
.5 The above polynucleotides and derivations thereof 

can be used as important tools in recombinant DNA and 
other protocols involving nucleic acid hybridization 
techniques. More specifically, oligonucleotides and 
nucleic acids derived from the isolated polynucleotides 
2 0 shown in Figures 1-4 (SEQ ID NOS : 1 , 3, 5, and 7) can be 

used as hybridization probes, capable of recognizing and 
specifically binding to complementary nucleic acid 
sequences, providing thereby a means of detecting, 
identifying, locating and measuring complementary nucleic 
25 acid sequences in a biological sample. 

Biological samples include, among a great many 
others, blood or blood serum, lymph, ascites fluid, 
urine, microorganism or tissue culture medium, cell 
extracts, or the like, derived from a biological source, 
30 or a solution containing chemically synthesized protein, 

- or an extract or solution prepared from such fluid from 
a biological source . 

An oligonucleotide containing a modified nucleotide 
of the invention can be used as a primer to initiate 
35 nucleic acid synthesis at locations in a DNA or RNA 

molecule comprising the sequence complementary to the 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 PCT/US98/07928 

12 



oligonucleotide sequence. The synthesized nucleic acid 
strand would have incorporated, at its 5' terminus, the 
oligonucleotide primer bearing the invention and would, 
therefore, be detectable by exploitation of the 
5 characteristics of the detectable label. Two such 

primers, specific for different nucleotide sequences on 
complementary strands of dsDNA, can be used in the 
polymerase chain reaction (PCR) to synthesize and amplify 
the amount of a nucleotide sequence. The detectable 
10 label present on the primers will facilitate the 

identification of desired PCR products. PCR, combined 
with techniques for preparing complementary DNA (cDNA) 
can be used to amplify various RNAs, with oligonucleotide 
primers again serving both to provide points for 
15 initiation of synthesis in the cDNA duplex flanking the 

desired sequence and to identify the desired product. 
Primers labeled with the invention may also be utilized 
for enzymatic nucleic acid sequencing by the dideoxy 
chain- termination technique. 
20 The invention can be applied to measure or 

quantitate the amount of DNA present in a sample. For 
instance, the concentration of nucleic acid can be 
measured by comparing detectable labels incorporated into 
the unknown nucleic acid with the concentration of 
25 detectable labels incorporated into known amounts of 

nucleic acid. 

Such a comparative assessment can be done using 
biotin where the respective concentrations are determined 
by an enzyme-linked assay utilizing the streptavidin- 
30 alkaline phosphatase conjugate and a substrate yielding 

a soluble chromogenic or chemiluminescent signal. 

3 . Recombinant Production of Human PAP 

In a further embodiment human PAP is expressed via 
35 recombinant methods known to those of skill in the art. 

The polynucleotides of the present invention can be 
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expressed in any number of different recombinant DNA 
expression systems to generate large amounts of protein, 
which can then be purified and used for the various 
applications of human PAP described above. Included 
5 within the present invention are proteins having native 

glycosylation sequences, and deglycosylated or 
^glycosylated proteins prepared by the methods described 
below. 

Recombinant technology for producing desired 
10 proteins is known by ordinarily skilled artisans and 

includes providing a coding sequence for a desired 
protein, and operably i inking the coding sequence to 
polynucleotide sequences capable of effecting its 
expression. 

15 With regard to one aspect of the invention, it often 

is desirable to produce human PAP as a fusion protein, 
freed from upstream, downstream or intermediate 
sequences, or as a protein linked to leader sequences, 
effecting secretion of human PAP into cell culture 
20 medium. 

A typical expression system will also contain 
control sequences necessary for transcription and 
translation of a message. Known control sequences, 
include constitutive or inducible promoter systems, 
25 translational initiation signals (in eucaryotic. 

expression) , polyadenylation translation termination 
sites, and transcription terminating sequences. 
Expression vectors containing controls which permit 
operably linking of desired coding sequences to required 
30 control systems are known by the skilled artisan. Such 

vectors can be found which are operable in a variety of 
hosts. 

Human PAP of the present invention may be produced 
in procaryotic cells using appropriate controls, such as 
35 trp or lac promoters, or in eucaryotic host cells, 

capable of effecting post- translational processing that 
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permits proteins to assume desired three-dimensional 
conformation. Eucaryotic control systems and expression 
vectors are known; including leu and glycolytic promoters 
useful in yeast, the viral SV40 and adenovirus and CMV 
5 promoters in mammalian cells, and the baculovirus system 

which is operable in insect cells. Plant vectors with 
suitable promoters, such as the nos promoter are also 
available . 

Standard laboratory manuals (e.g., Sambrook et al . , 
10 MOLECULAR CLONING: A LABORATORY MANUAL, Second Edition, 

(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY 1989) present standard techniques and methodologies 
for expressing polynucleotides encoding a desired 
protein, culturing appropriate cells, providing suitable 
15 expression conditions, and recovering a resulting protein 

from culture. 

In preparing the inventive human PAP, a suitable 
polynucleotide encoding human PAP, constructed utilizing 
any of the foregoing techniques is operable linked to an 
20 expression vector which is then transformed into a 

compatible host. Host cells are cultured using 

conditions appropriate for growth. Expression of the 
desired human PAP is preferably induced after some 
predetermined growth level has occurred. Human PAP 
25 production is monitored and the desired protein isolated 

from culture either from a supernatant, or by first 
lysing host cells with an appropriate agent, or by other 
methods known to the skilled artisan. 

In another preferred embodiment, a polynucleotide 
3 0 encoding human PAP is ligated into a mammalian expression 

vector. A preferred mammalian expression vector is the 
plasmid "pCE2 . " The plasmid pCE2 is derived from pREP7b 
(Leung, et al . , Proc . Natl. Acad. Sci. USA, 92: 4813- 
4817, 1995) with the RSV promoter region replaced by the 
35 CMV enhancer and the elongation factor-lar (EF-la) 

promoter and intron. The CMV enhancer of the pCE2 vector 
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is constructed from a 3 80 bp Xba I-Sph I fragment 
produced by-PCR from pCEP4 (Invitrogen, San Diego, CA) 
using the primers 5 1 -GGCTCTAGAT ATTAATAGTA ATCAATTAC-3 1 
(SEQ ID NO: 14) and 5 ' -CCTCACGCAT GCACCATGGT AATAGC-3 1 

5 (SEQ ID NO: 15) . The EF-la promoter and intron (Uetsuki, 

et al., J. Biol. Chem. , 264: 5791-5798, 1989) are 
constructed from a 1200 bp Sph I-Asp718 I fragment 
produced by PCR from human genomic DNA using the primers 
5 1 - GGTGCATGCG TGAGGCTCCG GTGC-3 ' (SEQ ID NO: 16) and 5'- 

0 GTAGTTTTCA CGGTACCTGA AATGGAAG-3 1 (SEQ ID NO: 17) . These 

2 fragments are ligated into a Xba I/Asp718 I digested 
vector derived from pREP7b to generate pCE2 . 

In another preferred embodiment of the present 
invention, pCE2 containing a polynucleotide expressing 

5 human PAP is used to transform a host cell which the*} 

expresses the protein. Preferred host cells include the 
human embryonic kidney cell line 293-EBNA (Invitrogen, 
San Diego, CA) , endothelial ECV304 cells, and epithelial 
A549 cells. 

0. 

4. Dephosphorvlation of Substrate 

In another embodiment, the present invention 
includes a method of dephosphorylating a substrate by 
contacting the substrate with an effective amount of 

5 isolated human PAP. An "effective amount" of human PAP 

is an amount which will dephosphorylate a detectable 
amount of substrate. Such an amount can be determined 
empirically based on variables well known to those of 
skill in the art, such as reaction time and temperature. 

0 In one embodiment, the substrate includes 

phosphatidic acid, lysophosphatidic acid, ceramide 1- 
phosphate, and sphingosine 1 -phosphate. In another 
embodiment, the isolated human PAP includes PAP-or(l and 
2), PAP-/3 and PAP-7 and variants thereof. 

5 In a further embodiment, the dephosphorylation of 

substrate occurs in vitro, by contacting a substrate with 
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recombinant ly produced human PAP expressed by the methods 
described above. The dephosphorylated substrate is then 
isolated by standard isolation and purification methods, 
including for example, thin layer chromatography or high 
pressure liquid chromatography. 

In another embodiment, the dephosphorylation of 
substrate occurs in vivo via the administration of human 
PAP to a mammal, preferably a human. "Administration" 
means delivery of human PAP protein to a mammal by 
methods known to those of skill in the art including, but 
not limited to: orally, for example in the form of 
pills, tablets, lacquer tablets, coated tablets, 
granules, hard gelatin capsules, soft gelatin capsules, 
solutions, syrups, emulsions, suspensions or aerosol 
mixtures; rectally, for example in the form of 
suppositories; parenterally , for example in the form of 
injection solutions or infusion solutions, microcapsules 
or rods; percutaneously , for example in the form of 
ointments or tinctures; transdermally ; intravascularly , 
0 intracavitarily ; intramuscularly; subcutaneous ly ; and 

nasally, for example in the form of nasal sprays or 
inhalants . 

The administration of human PAP protein includes the 
administration of the protein combined in a mixture with 

5 a pharmaceutical^ acceptable carrier vehicle. Suitable 

vehicles and their formulation, inclusive of other human 
proteins, e.g. human serum albumin, are described for 
example in Remington's Pharmaceutical Sciences by E.W. 
Martin, which is hereby incorporated by reference. Such 

0 compositions will contain an effective amount of protein 

hereof together with a suitable amount of vehicle in 
order to prepare pharmaceutically acceptable compositions 
suitable for effective administration to the host. 

Such compositions should be stable for appropriate 

5 periods of time, preferably are acceptable for 

administration to humans and preferably are readily 
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manuf acturable . Although pharmaceutical solution 

formulations are provided in liquid form appropriate for 
immediate use, formulations may also be provided in 
frozen or in lyophilized form. In the former case, the 
5 composition must be thawed prior to use. The latter form 

is often used to enhance the stability of the medicinal 
agent contained in the composition under a wide variety 
of storage conditions. Such lyophilized preparations are. 
reconstituted prior to use by the addition of suitable 
10 pharmaceutically acceptable diluents, such as sterile 

water or sterile physiological saline solution. 

Additionally, administration is meant to include 
delivery of human PAP protein to a mammal by means of 
gene therapy techniques, i.e., by the delivery of 
15 polynucleotides encoding human PAP to PAP-def icient 

cells, whereby human PAP is then expressed in the cell. 
Gene therapy techniques are known to those of skill in 
the art. For example, listing of present-day vectors 
suitable for use in gene therapy of the present invention 
20 is set forth in Hodgson, Bio /Technology 13: 222 (1995) . 

See also . Culver et al . , Science, 256:1550-62 (1992). 

Additionally, liposome-mediated gene transfer is 
another suitable method for the introduction of a 
recombinant vector containing a polynucleotide,, encoding 
25 human PAP into a PAP-def icient cell. See Caplen et al . , 

Nature Med. 1:39-46 (1995) and Zhu et al..., Science 
* 261:209-211 (1993) . 

• Additionally, viral vector-mediated gene transfer is 
also a suitable method for the introduction of a 
3 0 recombinant vector containing the gene encoding human PAP 

into a PAP-def icient cell. Examples of appropriate viral 
vectors are adenovirus vectors. Detailed discussions of 
the use of adenoviral vectors for gene therapy can be 
found in Berkner, Biotechniques 6:616-629 (1988), 
35 Trapnell, Advanced Drug Delivery Rev. 12:185-199 (1993). 
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The following examples merely illustrate the 
invention and, as such, are not to be considered as 
limiting the invention set forth in the claims. 

5 Example 1 

Cloning and Expression of Human PAP-a, PAP-ff and PAP-7 

Homology search of the Genbank database (Boguski, 
et al-, Science '265:1993-1994, 1994) of expressed 
10 sequence tag (dbEST) using the murine PAP protein 

sequence (Kai et al . , J. Biol. Chem. 271: 18931-18938, 
1996) as probe identified several short stretches of 
human cDNA sequences with homology to the murine PAP 
protein sequence. These cDNA sequences of interest were 
15 derived from single -run partial sequencing of random 

human cDNA cloning projects carried out mainly by 
l.M.A.G.E. Consortium [LLNL] cDNA clones program. Based 
on the partial DNA sequences available in the GenBank 
database, the human cDNA clones that are homologous to 
20 the murine PAP protein sequence can be grouped into three 

classes, suggesting the presence of at least three 
different human PAP variants, designated as PAP-a, PAP- 
j3, and PAP-y here. For instance, a potential human PAP-a 
clone (GenBank #H17855) identified contains sequence 
25 homologous to aa 272-283 and the 3 ' -untranslated region 

of murine PAP; a potential human PAP-/3 clone (GenBank 
#W70040) identified contains sequence similarities 
corresponding to aa 175-251 of murine PAP; and a 
potential human PAP -7 clone (GenBank #N75714) identified 
3 0 contains sequences similarities corresponding to aa 18- 

142 of murine PAP. These cDNA clones were purchased 
(Genome Systems, St. Louis, MO) for further analysis. 
DNA sequence determination of the entire cDNA inserts of 
these clones showed clone H17855 contained sequences that 
3 5 are homologous to the N- and C- terminal sequences of 

murine PAP with a gap of about 15 0 bp that led to a frame 
shift in reading frame. This clone is most likely a 
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spuriously spliced form of PAP-a clone. Clone W70040 was 
found to be a full-length PAP-0 clone, and clone N75714 
was found to be a partial PAP-7 clone with an open 
reading frame homologous to the region from aal8 to the 
5 C- terminus of murine PAP. 

To assemble a full-length functional PAP-or clone, 
synthetic oligonucleotides o_papalF, 5 ' -ggcatggtAC 
CATGTTTGAC AAGACGCGGC-3 * (SEQ ID NO : 18 ) , based on the N- 
terminal region of PAP-a and o_papalR, 5 ' - CATATGTAGT 

10 ATTCAATGTA ACC-3' (SEQ ID NO:19), based on a region 

downstream of a Pst I site complementary to the coding 
strand of PAP-a were used to amplify the N- terminal 
coding region of PAP-a from a human lung cDNA library 
(Life Technologies, Inc., Gaithersburg , MD) . The 450 bp 

15 Acc65 I - Pst I fragment generated was inserted into a 

Acc65 I / Pst I vector from pBluescript (II) SK(-) 
(Stratagene, San Diego, CA) for further analysis. DNA 
sequence analysis of the subclones obtained revealed at 
least two different classes of clones with sequences that 

20 diverged at the putative exon of interest, suggesting the 

presence of two alternatively spliced forms of PAP-a. 
These two alternatively spliced forms of PAP-a ' are 
designated as PAP-al and PAP-a2 here. Each of the. 
individual 450 bp Acc65 I - Pst I fragment generated by 

2 5 PCR was combined with the 810 bp Pst I - Not I fragment 

derived from clone H17855 for ligation into a Acc65 I / 
Not I mammalian expression vector derived from pCE2 for 
the generation of expression plasmids for PAP-al and PAP- 
a2. The plasmid pCE2 was derived from pREP7b (Leung, et 

30 al., Proc. Natl. Acad. Sci . USA, 92: 4813-4817, 1995) 

with the RSV promoter region replaced by the CMV enhancer 
and the elongation factor- la (EF-la) promoter and intron. 
The CMV enhancer of the pCE2 vector was constructed from 
a 380 bp Xba I-Sph I fragment produced by PCR from pCEP4 

35 (Invitrogen, San Diego, CA) using the primers 5'- 

GGCTCTAGAT ATTAATAGTA ATCAATTAC - 3 1 (SEQ ID NO: 14) and 5 1 - 
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CCTCACGCAT GCACCATGGT AATAGC-3 1 (SEQ ID NO: 15) . The EF- 
lor promoter and intron (Uetsuki, et al . , J. Biol. Chem. , 
264: 5791-5798, 1989) was constructed from a 1200 bp Sph 
I-Asp718 I fragment produced by PCR from human genomic 
5 DNA using the primers 5 1 -GGTGCATGCG TGAGGCTCCG GTGC-3 ■ 

(SEQ ID NO: 16) and 5 ' -GTAGTTTTCA CGGTACCTGA AATGGAAG-3 1 
(SEQ ID NO: 17) . These 2 fragments were ligated into a 
Xba I/Asp718 I digested vector derived from pREP7b to 
generate pCE2 . 

0 The DNA sequence determined from clone N75714 was 

used as a probe to search for clones with overlapping 
sequences in the GenBank database. Clone Z43618 was 
found to contain an additional 5 '-sequence with a 
potential ATG initiation codon. To assemble a full- 
15 length PAP-Y clone, synthetic oligonucleotides o_papglF, 

5 ' -tgatggctag cATGCAGAGA AGATGGGTCT TCGTGCTGCT CGACGTG-3 ' 
(SEQ ID NO: 20) , based on the N- terminal region of PAP-7 
and o_papglR, 5 ' -AGTGCGGGAT CCCATAAGTG GTTG-3 ' , (SEQ ID 
NO: 21) based on a region complementary to the coding 
20 strand of PAP-7 just downstream of its stop codon were 

used to generate the full-length coding region of PAP-7 
by PCR using the clone N75714 as template. The 820 bp 
Nhe I - BamH I fragment obtained was then ligated into a 
Nhe I / BamH I mammalian expression vector derived from 
2 5 pCE2. 

Figures 1, 2, 3 and 4 show the translated DNA 
sequences of the putative human cDNA clones for PAP-al, 
a2, /3 and 7, (SEQ ID NOS : 1 , 3, 5 and 7) respectively. 
The designated ATG initiation site for translation of 
30 each cDNA clone fulfills the requirement for an adequate 

initiation site according to Kozak (Kozak, Critical Rev. 
Biochem. Mol . Biol. 27:385-402, 1992). 

The amino acid sequence of each open reading frame 
(Figures 1, 2, 3 and 4 (SEQ ID N0S:2, 4, 6 and 8)) was 
35 used as the query sequence to search for homologous 

sequences in protein databases . Search of the Genbank 
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database from the National Center for Biotechnology 
Information (NCBI) using the blastp program showed that 
these proteins are most homologous to the murine PAP 
sequence (Kai et al., J. Biol. Chem. 271: 18931-18938, 
1996) , and a rat endoplasmic reticulum resident 
transmembrane protein of unknown function, Dri 42, whose 
expression is up-regulated during epithelial 
differentiation (Barila et al . , J. Biol. Chem. 271: 
29928-29936, 1996) . 

Example 2 

Activation of PAP-fi Transcription by ILl-ff 

It is possible that activation of PAP-/? expression 
can counter-balance the inflammatory response from IL-l/? 
stimulation through degradation of the excess amount of 
PA in cells. To determine whether IL1-/3, an inflammatory 
cytokine, would activate the transcription of PAP mRNAs, 
Northern analysis of PAP-/? mRNA levels (Fig. 6) was 
performed in human endothelial ECV304 cells at various 
times after IL-l/? stimulation. Figure 6 shows that PAP-/? 
mRNA expression was induced after incubation of ECV3 04 
cells with IL-1/3 after at least 6 hours, suggesting that 
PAP-j? is a late-response gene to IL-l/? stimulation. This 
indicates that human PAP may act to reduce IL-l/? induced 
inflammation by degrading excess PA in cells. 

Example 3 

PAP-al and PAP-a2 Dephosphorvlation of PA to DAG 

The expression of PAP-cel and PAP-c*2 cDNA was found 
to increase PA dephosphorylation in mammalian cells. 
The expression plasmids for PAP-al, PAP-a2 and the 
control vector were transiently transfected into 2 93-EBNA 
5 (EB293) cells (Invitrogen, San Diego, CA) using the 

lipofectant DOTAP (Boehrihger Mannheim, Indianapolis, 
IN) . PAP activities were followed by TLC analysis based 
on the conversion of [C 14 ] PA (DuPont NEN, Boston, MA) to 
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[C ,4 ]DAG using membrane fractions isolated from the 
various cell extracts. Figure 7 shows membrane fractions 
derived from cells transfected with either the PAP-al 
(lanes 6 and 7) or PAP-a2 (lanes 8 and 9) produced more 
[C ,4 ]DAG those from untransf ected cells (lanes 2 and 3) or 
from cells transfected with the control pCE2 vector 
(lanes 4 and 5) . In this particular chromatography 
system, DAG can be resolved into two bands, possibly due- 
to heterogeneity in the acyl-chains. It appears that 
PAP-al and PAP-a2 preferentially dephosphorylate 
different species of PA as evidenced by the change in 
relative intensity of the two DAG bands (lanes 6 to 9) . 

Example 4 

Differential Expression o f PAP-a mRNA in 
fiplseted Tumor Versu s Normal Tissues 

The possibility that PAP-a expression can degrade the 
excess amount of PA in cells suggests that PAP-a may be 
• down-regulated in tumor cells when compared to normal 
20 cells, as tumor cells tend to be more inflammatory due to 

a possibly higher level of PA when compared to normal' or 
resting cells. To test this hypothesis, Northern 
analysis using PAP-a (1 and 2) cDNA probe was performed on 
RNA blots derived from various matching pairs of tumor 
25 and normal tissues (Invitrogen, Carlsbad, CA) . Figure 8 

shows the expression levels of PAP-a mRNA are 
substantially higher in five out of eight of the normal 
tissues examined; namely, colon, rectal, breast, 
fallopian tube, and ovarian tissues when compared to the 
30 corresponding tumor tissues. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: LEUNG, David W. 

TOMPKINS , Christopher K. 

(ii) TITLE OF INVENTION: HUMAN PHOSPHATIDIC ACID PHOSPHATASE 
(iii) NUMBER OF SEQUENCES : 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Foley & Lardner 

(B) STREET: 3000 K Street, N.W., Suite 500 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20007-5109 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/842,827 

(B) FILING DATE: 17-APR-1997 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BENT, Stephen A. 

(B) REGISTRATION NUMBER: 29,768 

(C) REFERENCE /DOCKET NUMBER: 77319/125 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: {202)672-5300 

(B) TELEFAX: (202)672-53 99 

(C) TELEX: 904136 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ix) FEATURE: 

(A) . NAME /KEY: CDS 

(B) LOCATION: 342 . . 1193 

(ix) FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 342.. 1193, 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCTGTGGGAG AGAGCGCCGG GATCCGGACG GGGTAGCAAC CGGGGCAGGC CGTGCCGGCT 60 
GAGGAGGTCC TGAGGCTACA GAGCTGCCGC GGCTGGCACA CGAGCGCCTC GGCACTAACC 120 
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GAGTGTTCGC GGGGGCTGTG AGGGGAGGGC CCCGGGCGCC ATTGCTGGCG GTGGGAGCGC 180 

CGCCCGGTCT CAGCCCGCCC TCGGCTgVtC TCCTCCTCCG GCTGGGAGGG GCCGTATCTC 240 

GGGGCCGTCG CCAGCCCCGG CCCGGGCTCG ATAATCAAGG GCCTCGGCCG TCGTCCCGCA 3 00 

CCTCATTCCA TCGCCCTTGC CGGGCAGCCC GGGCAGAGAC C ATG TTT GAC AAG 3 53 

Met Phe Asp Lys 
X 

ACG CGG CTG CCG TAC GTG GCC CTC GAT GTG CTC TGC GTG TTG CTG GCT 401 
Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys Val Leu Leu Ala 
5 10 15 20 

GGA TTG CCT TTT GCA ATT CTT ACT TCA AGG CAT ACC CCC TTC CAA CGA 449 
Glv Leu Pro Phe Ala lie Leu Thr Ser Arg His Thr Pro Phe Gin Arg 
25 30 35 

GGA GTA TTC TGT AAT GAT GAG TCC ATC AAG TAC CCT TAC AAA GAA GAC 497 
Gly Val Phe Cys Asn Asp Glu Ser lie Lys Tyr Pro Tyr Lys Glu Asp 
40 45 50 

ACC ATA CCT TAT GCG TTA TTA GGT GGA ATA ATC ATT CCA TTC AGT ATT 545 
Thr lie Pro Tyr Ala Leu Leu Gly Gly lie lie lie Pro Phe Ser lie 
55 60 65 

ATC GTT ATT ATT CTT GGA GAA ACC CTG TCT GTT TAC TGT AAC CTT TTG 5 93 

lie Val lie lie Leu Gly Glu Thr Leu Ser Val Tyr Cys Asn Leu Leu 
70 ~ 75 80 

CAC TCA AAT TCC TTT ATC AGG AAT AAC TAC ATA GCC ACT ATT TAC AAA 641 
His Ser Asn Ser Phe He Arg Asn Asn Tyr He Ala Thr He Tyr Lys 
85 90 95 100 

GCC ATT GGA ACC TTT TTA TTT GGT GCA GCT GCT AGT CAG TCC CTG ACT 689 
Ala He Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser Gin Ser Leu Thr 
105 HO 115 

GAC ATT GCC AAG TAT TCA ATA GGC AGA CTG CGG CCT CAC TTC TTG GAT 73 7 

Asp He Ala Lys Tyr Ser He Gly Arg Leu Arg Pro His Phe Leu Asp 
120 125 130 

GTT TGT GAT CCA GAT TGG TCA AAA ATC AAC TGC AGC GAT GGT TAC ATT 785 
Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys Ser Asp Gly Tyr He 
135 140 145 

GAA TAC TAC ATA TGT CGA GGG AAT GCA GAA AGA GTT AAG GAA GGC AGG 833 
Glu Tyr Tyr He Cys Arg Gly Asn Ala Glu Arg Val Lys Glu Gly Arg 
150 * 155 160 

TTG TCC TTC TAT TCA GGC CAC TCT TCG TTT TCC ATG TAC TGC ATG CTG 8 81 

Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met Tyr Cys Met Leu 
165 * 170 175 180 

TTT GTG GCA CTT TAT CTT CAA GCC AGG ATG AAG GGA GAC TGG GCA AGA 929 
Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly Asp Trp Ala Arg 
185 190 195 

CTC TTA CGC CCC ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT TAT 977 
Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val Ala Val Ser He Tyr 
200 205 210 

GTG GGC CTT TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT GTG 1025 
Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp Val 
215 220 225 
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TTG ACT GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT GTA 1073 
Leu Thr Gly Leu lie Gin Gly Ala Leu Val Ala He Leu Val Ala Val 
230 235 240 

TAT GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA AAA 1121 
Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe Lys Glu Arg Lys 
245 250 255 " 260 

GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT GGG AAT 1169 
Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr Gly Asn 
265 270 275 

CAC TAT CCG AGC AAT CAC CAG CCT TGAAAGGCAG CAGGGTGCCC AGGTGAAGCT 1223 
His Tyr Pro Ser Asn His Gin Pro 
280 

GGCCTGTTTT CTAAAGGAAA ATGATTGCCA CAAGGCAAGA GGATGCATCT TTCTTCCTGG 1283 

TGTACAAGCC TTTAAAGACT TCTGCTGCTG ATATGCCTCT TGGATGCACA CTTTGTGTGT 1343 

ACATAGTTAC CTTTAACTCA GTGGTTATCT AATAGCTCTA AACTCATTAA AAAAACTCCA 14 03 

AGCCTTCCAC CAAAACAGTG CCCCACCTGT ATACATTTTT ATTAAAAAAA TGTAATGCTT 1463 

ATGTATAAAC ATGTATGTAA TATGCTTTCT ATGAATGATG TTTGATTTAA ATATAATACA 1523 

TATTAAAATG TATGGGAGAA CCAAAAAAAA AAAAAAAAAA 1563 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 
15 10 15 

Val Leu Leu Ala Gly Leu Pro Phe Ala He Leu Thr Ser Arg His Thr 
20 25 30 

Pro Phe Gin Arg Gly Val Phe Cys Asn Asp Glu Ser He Lys Tyr Pro 
35 40 . 45 

Tyr Lys Glu Asp Thr He Pro Tyr Ala Leu Leu Gly Gly He He He 
50 55 60 

Pro Phe Ser He He Val He lie Leu Gly Glu Thr Leu Ser Val Tyr 
65 70 75 80 

Cys Asn Leu Leu His Ser Asn Ser Phe lie Arg Asn Asn Tyr He Ala 
85 90 95 

Thr He Tyr Lys Ala lie Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser 
100 105 110 

Gin Ser Leu Thr Asp He Ala Lys Tyr Ser He Gly Arg Leu Arg Pro 
115 120 125 

His Phe Leu Asp 1 Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys Ser 
130 135 140 
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Asp Gly Tyr He Glu Tyr Tyr He Cys Arg Gly Asn Ala Glu Arg Val 



145 



Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met 

1 165 170 



Tyr Cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly 



1B0 



Asp Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val Ala 

val Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His His 

210 215 220 

Trp Ser Asp Val Leu Thr Gly Leu He Gin Gly Ala Leu Val Ala He 
225 230 

Leu Val Ala Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe 
245 250 25 

Lys Glu Arg Lys Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro 
260 265 270 

Thr Thr Gly Asn His Tyr Pro Ser Asn His Gin Pro 
275 280 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 342.. 1196 

(ix) FEATURE: 

(A) NAME /KEY : matjeptide 

(B) LOCATION: 342.. 1196 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCTGTGGGAG AGAGCGCCGG GATCCGGACG GGGTAGCAAC CGGGGCAGGC CGTGCCGGCT 

GAGGAGGTCC TGAGGCTACA GAGCTGCCGC GGCTGGCACA CGAGCGCCTC GGCACTAACC 

GAGTGTTCGC GGGGGCTGTG AGGGGAGGGC CCCGGGCGCC ATTGCTGGCG GTGGGAGCGC 

CGCCCGGTCT CAGCCCGCCC TCGGCTGCTC TCCTCCTCCG GCTGGGAGGG GCCGTATCTC 

GGGGCCGTCG CCAGCCCCGG CCCGGGCTCG ATAATCAAGG GCCTCGGCCG TCGTCCCGCA 

rrTrATTCCA TCGCCCTTGC CGGGCAGCCC GGGCAGAGAC C ATG TTT GAC AAG 

Met Phe Asp Lys 
1 

ACG CGG CTG CCG TAC GTG GCC CTC GAT GTG CTC TGC GTG TTG CTG GCT 
Thr Arq Leu Pro Tyr Val Ala Leu Asp Val Leu Cys Val Leu Leu Ala 
5 10 15 20 



60 
120 
180 
240 
300 
353 

401 
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TCC ATG CCT ATG GCT GTT CTA AAA TTG GGC CAA ATA TAT CCA TTT CAG 44 9 

Ser Met Pro Met Ala Val Leu Lys Leu Gly Gin lie Tyr Pro Phe Gin 
25 30 35 

AGA GGC TTT TTC TGT AAA GAC AAC AGC ATC AAC TAT CCG TAC CAT GAC 4 97 

Arg Gly Phe Phe Cys Lys Asp Asn Ser lie Asn Tyr Pro Tyr His Asp 
40 45 " 50 

AGT ACC GCC GCA TCC ACT GTC CTC ATC CTA GTG GGG GTT GGC TTG CCC 545 
Ser Thr Ala Ala Ser Thr Val Leu lie Leu Val Gly Val Gly Leu Pro 
55 60 65 

GTT TCC TCT ATT ATT CTT GGA GAA ACC CTG TCT GTT TAC TGT AAC CTT 593 
Val Ser Ser lie lie Leu Gly Glu Thr Leu Ser Val Tyr Cys Asn Leu 
70 75 80 

TTG CAC TCA AAT TCC TTT ATC AGT AAT AAC TAC ATA GCC ACT ATT TAC 641 
Leu His Ser Asn Ser Phe lie Ser Asn Asn Tyr lie Ala Thr lie Tyr 
85 90 95 100 

AAA GCC ATT GGA ACC TTT TTA TTT GGT GCA GCT GCT AGT CAG TCC CTG 6 89 

Lys Ala lie Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser Gin Ser Leu 
105 ' 110 115 

ACT GAC ATT GCC AAG TAT TCA ATA GGC AGA CTG CGG CCT CAC TTC TTG 73 7 

Thr Asp lie Ala Lys Tyr Ser lie Gly Arg Leu Arg Pro His Phe Leu 
120 125 130 

GAT GTT TGT GAT CCA GAT TGG TCA AAA ATC AAC TGC AGC GAT GGT TAC 78 5 

Asp Val Cys Asp Pro Asp Trp Ser Lys lie Asn Cys Ser Asp Gly Tyr 
135 140 ~ 145 

ATT GAA TAC TAC ATA TGT CGA GGG AAT GCA GAA AGA GTT AAG GAA GGC 833 
lie Glu Tyr Tyr lie Cys Arg Gly Asn Ala Glu Arg Val Lys Glu Gly 
150 155 160 

AGG TTG TCC TTC TAT TCA GGC CAC TCT TCG TTT TCC ATG TAC. TGC ATG 881 
Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met Tyr Cys Met 
165 170 175 180 

CTG TTT GTG GCA CTT TAT CTT CAA GCC AGG ATG AAG . GGA GAC . TGG GCA 929 
Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly Asp Trp Ala 
185 190 195 

AGA CTC TTA CGC CCC ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT 977 
Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val Ala Val Ser lie 
200 205 210 

TAT GTG GGC CTT TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT 1025 
Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
215 220 225 

GTG TTG ACT GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT 1073 
Val Leu Thr Gly Leu lie Gin Gly Ala Leu Val Ala lie Leu Val Ala 
230 " 235 * 240 

GTA TAT GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA 1121 
Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe Lys Glu Arg 
245 250 255 260 

AAA GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT GGG 116 9 

Lys Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr Gly 
265 270 275 

AAT CAC TAT CCG AGC AAT CAC CAG CCT TGAAAGGCAG CAGGGTGCCC 1216 
Asn His Tyr Pro Ser Asn His Gin Pro 
280 285 
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AGGTGAAGCT GGCCTGTTTT CTAAAGGAAA ATGATTGCCA CAAGGCAAGA GGATGCATCT 
TTCTTCCTGG TGTACAAGCC TTTAAAGACT TCTGCTGCTG ATATGCCTCT TGGATGCACA 
CTTTGTGTGT ACATAGTTAC CTTTAACTCA GTGGTTATCT AATAGCTCTA AACTCATTAA 
AAAAACTCCA AGCCTTCCAC CAAAACAGTG CCCCACCTGT ATACATTTTT ATTAAAAAAA 
TGTAATG CTT ATGTATAAAC ATGTATGTAA T ATG CTTTCT ATGAATGATG TTTGATTTAA 
ATATAATACA TATTAAAATG TATGGGAGAA CCAAAAAAAA AAAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 
x 5 10 « 

Val Leu Leu Ala Ser Met Pro Met Ala Val Leu Lys Leu Gly Gin He 
20 25 30 

Tyr Pro Phe Gin Arg Gly Phe Phe Cys Lys Asp Asn Ser He Asn Tyr 
35 40 45 

Pro Tyr His Asp Ser Thr Ala Ala Ser Thr Val Leu He Leu Val Gly 
50 ~ 55 60 

Val Gly Leu Pro Val Ser Ser He He Leu Gly Glu Thr Leu Ser Val 
65 70 75 80 

Tvr Cvs Asn Leu Leu His Ser Asn Ser Phe He Ser Asn Asn Tyr He 
Y y 85 90 95 

Ala Thr He Tyr Lys Ala He Gly Thr Phe Leu Phe Gly Ala Ala Ala 
100 105 HO 

Ser Gin Ser Leu Thr Aso He Ala Lys Tyr Ser He Gly Arg Leu Arg 
115 120 125 

Pro His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys 
13 0 135 140 

Ser Asp Gly Tyr He Glu Tyr Tyr He Cys Arg Gly Asn Ala Glu Arg 
145 150 155 160 

Val Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser 
165 170 175 

Met Tyr Cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys 
180 185 190 

Gly Asp Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val 
195 200 205 

Ala Val Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His 
210 ' 215 220 



1276 

1336 

1396 

1456 

1516 

1566 
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His Trp Ser Asp Val Leu Thr Glv 
225 230 

He Leu Val 



Phe Lys Glu Arg Lys Glu Glu Aso 
260 

Pro Thr Thr Gly Asn His Tyr Pre 
275 28C 

(2) INFORMATION FOR SEQ ID NO : 5 : 



Leu He Gin Gly Ala Leu Val Ala 
235 240 



Ser His Thr Thr Leu His Glu Thr 
265 270 

Ser Asn His Gin Pro 
285 



Ala Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser 
245 250 255 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 294 1226 

(ix) FEATURE : 

(A) NAME /KEY : mat_peccide 

(B) LOCATION: 294 . . 1226 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GGCGCAGCTC TGCAAAAGTT TCTGCTCGGG ATCTGGCTCT CTTCCCCTTG GACTTTAGAA 60 

CGATTTAGGG TTGACAGAGG AAAGCAGAGG CGCGCAGGAG GAGCAGAAAA CACCACCTTC 120 

TGCAGTTGGA GGCAGGCAGC CCCGGCTGCA CTCTAGCCGC CGCGCCCGGA GCCGGGGCCG 180 

ACCCGCCACT ATCCGCAGCA GCCTCGGCCA GGAGGCGACC CGGGCGCCTG GGTGTGTGGC 240 

TGCTGTTGCG GGACGTCTTC GCGGGGCGGG AGGCTCGCGC CGCAGCCAGC GCC ATG 296 

Met 
1 

CAA AAC TAC AAG TAC GAC AAA GCG ATC GTe CCG GAG AGC AAG AAC GGC 344 
Gin Asn Tyr Lys Tyr Asp Lys Ala He Val Pro Glu Ser Lys Asn Gly 
5 10 15 

GGC AGC CCG GCG CTC AAC AAC AAC CCG AGG AGG AGC GGC AGC AAG CGG 3 92 

Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys Arg 
20 25 30 

GTG CTG CTC ATC TGC CTC GAC CTC TTC TGC CTC TTC ATG GCG GGC CTC 440 
Val Leu Leu He Cys Leu Asp Leu Phe Cys Leu Phe Met Ala Gly Leu 
35 40 45 

CCC TTC CTC ATC ATC GAG ACA AGC ACC ATC AAG CCT TAC CAC CGA GGG 4 88 

Pro Phe Leu He He Glu Thr Ser Thr He Lys Pro Tyr His Arg Gly 
50 55 60 65 

TTT TAC TGC AAT GAT GAG AGC ATC AAG TAC CCA CTG AAA ACT GGT GAG 536 
Phe Tyr Cys Asn Asp Glu Ser He Lys Tyr Pro 'Leu Lys Thr Gly Glu 
70 ■ 75 80 
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776 



824 



872 



APR A~A AAT GAC GCT GTG CTC TGT GCC GTG GGG ATC GTC ATT GCC ATC 584 
tS t£ JE Sp Ala Val Leu Cys Ala Val Gly He Val lie Ala lie 
85 90 

CTC GC3 ATC ATC ACG GGG GAA TTC TAC CGG ATC TAT TAC CTG AAG AAG 632 
2u lit 111 lie Thr Gly Glu Phe Tyr Arg He Tyr Tyr Leu Lys Lys 
100 105 110 

TCG C~-3 TCG ACG ATT CAG AAC CCC TAC GTG GCA GCA CTC TAT AAG CAA 680 
Ser Arg Ser Thr He Gin Asn Pro Tyr Val Ala Ala Leu Tyr Lys Gin 
^-i J 120 125 

GTG GGC TGC TTC CTC TTT GGC TGT GCC ATC AGC CAG TCT TTC ACA GAC 728 
SI? G?v Cys III Leu Phe Gly Cys Ala lie Ser Gin Ser Phe Thr Asp 
130 ' I 35 140 

ATT GCC AAA GTG TCC ATA GGG CGC CTG CGT CCT CAC TTC TTG ACT GTC 
lie 111 Lys Val Ser He Gly Arg Leu Arg Pro His Phe Leu Ser Val 

TCC AAC CCT GAT TTC AGC CAG ATC AAC TGC TCT GAA GGC TAC ATT CAG 
Cys Pro Asp Phe Ser Gin He Asn Cys Ser Glu Gly Tyr He Gin 

165 170 175 

AAC Tl" AGA TGC AGA GGT GAT GAC AGC AAA GTC CAG GAA GCC AGG AAG 
Asn T~ Arg Cys Arg Gly Asp Asp Ser Lys Val Gin Glu Ala Arg Lys 
180 ~ 185 190 

^ TTC TCT GGC C AT GCC TCC TTC TCC ATG TAC ACT ATG CTG TAT 920 
Ser P*° Phe Ser Gly His Ala Ser Phe Ser Met Tyr Thr Met Leu Tyr 
195 * 200 205 

TTG GTG CTA TAC CTG CAG GCC CGC TTC ACT TGG CGA GGA GCC CGC CTG 968 
Leu Val Leu Tyr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg Leu 
210 215 220 225 

CTC CGG CCC CTC CTG CAG TTC ACC TTG ATC ATG ATG GCC TTC TAC ACG 1016 
Leu Ar= Pro Leu Leu Gin Phe Thr Leu He Met Met Ala Phe Tyr Thr 
230 235 240 

GGA • C3 TCT CGC GTA TCA GAC CAC AAG CAC CAT CCC AGT GAT GTT CTG 1064 
Glv Leu Ser Arg Val Ser Asp His Lys His His Pro Ser Asp Val Leu 
y 245 250 255 

GCA GGA TTT GCT CAA GGA GCC CTG GTG GCC TGC TGC ATA GTT TTC TTC 1112 
Ala Glv Phe Ala Gin Gly Ala Leu Val Ala Cys Cys He Val Phe Phe 
* 260 265 270 

GTG TC~ GAC CTC TTC AAG ACT AAG ACG ACG CTC TCC CTG CCT GCC CCT 1160 
Val Se" Asp Leu Phe Lys Thr Lys Thr Thr Leu Ser Leu Pro Ala Pro 
275 280 285 

GCT ATC CGG AAG GAA ATC CTT TCA CCT GTG GAC ATT ATT GAC AGG AAC 1208 
Ala I 1 o Arg Lys Glu He Leu Ser Pro Val Asp He He Asp Arg Asn 
290 ' ~ * 295 300 305 

AAT CAC CAC AAC ATG ATG TAGGTGCCAC CCACCTCCTG AGCTGTTTTT 1256 
Asn His His Asn Met Met 
310 

GTAAAATGAC TGCTGACAGC AAGTTCTTGC TGCTCTCCAA TCTCATCAGA CAGTAGAATG 1316 
TAGGGAAAAA CTTTTGCCCG ACTGATTTTT AAAAAAAAAA AAAAAA I 362 
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(2) INFORMATION FOR SEQ ID NO : 6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Gin Asn Tyr Lvs Tyr Asp Lys Ala lie Val Pro Glu Ser Lys Asn 
1 * 5 10 15 

Gly Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys 
20 25 30 

Arg Val Leu Leu lie Cys Leu Asp Leu Phe Cys Leu Phe Met Ala Gly 
35 40 45 

Leu Pro Phe Leu lie lie Glu Thr Ser Thr lie Lys Pro Tyr His Arg 
50 55 60 

Gly Phe Tyr Cys Asn Asp Glu Ser lie Lys Tyr Pro Leu Lys Thr Gly 
65 ' 70 75 80 

Glu Thr He Asn Asp Ala Val Leu Cys Ala Val Gly He Val He Ala 
85 90 95 

He Leu Ala He He Thr Gly Glu Phe Tyr Arg He Tyr Tyr Leu Lys 
100 105 110 

Lys Ser Arg Ser Thr He Gin Asn Pro Tyr Val Ala Ala Leu Tyr Lys 
115 120 125 

Gin Val Gly Cys Phe Leu Phe Gly Cys Ala He Ser Gin Ser Phe Thr 
130 * 135 140 

Asp He Ala Lys Val Ser He Gly Arg Leu Arg Pro His Phe Leu Ser 
145 150 155 160 

Val Cys Asn Pro Asd Phe Ser Gin He Asn Cys Ser Glu Gly Tyr He 
165 170 175 

Gin Asn Tyr Arg Cvs Arg Gly Asp Asp Ser Lys Val Gin Glu Ala Arg 
180 185 190 

Lys Ser Phe Phe Ser Gly His Ala Ser Phe Ser Met Tyr Thr. Met Leu 
195 200 • 205 

Tyr Leu Val Leu Tyr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg 
210 215 220 

Leu Leu Arg Pro Leu Leu Gin Phe Thr Leu He Met Met Ala Phe Tyr 
225 230 235 240 

Thr Gly Leu Ser Arg Val Ser Asp His Lys His His Pro Ser Asp Val 
245 250 255 

Leu Ala Gly Phe Ala Gin Gly Ala Leu Val Ala Cys Cys He Val Phe 
260 265 270 

Phe Val Ser Asp Leu Phe Lys Thr Lys Thr Thr Leu Ser Leu Pro Ala 
275 280 285 

Pro Ala He Arg Lvs Glu He Leu Ser Pro Val Asp He He Asp Arg 
290 ~ 295 300 
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Asn Asn His His Asn Met Met 
305 310 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 4.. 833 

1 (ix) FEATURE: 

(A) NAME / KEY : mat_jpeptide 

(B) LOCATION: 4.. 833 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ACC ATG CAG CGG AGG TGG GTC TTC GTG CTG CTC GAC GTG CTG TGC TTA 48 
Met Gin Arg Arg Trp Val Phe Val Leu Leu Asp Val Leu Cys Leu 
1 5 10 15 

CTG GTC GCC TCC CTG CCC TTC GCT ATC CTG ACG CTG GTG AAC GCC CCG 96 
Leu Val Ala Ser Leu Pro Phe Ala lie Leu Thr Leu Val Asn Ala Pro 
20 25 30 

TAC AAG CGA GGA TTT TAC TGC GGG GAT GAC TCC ATC CGG TAC CCC TAC 144 
Tyr Lys Arg Gly Phe Tyr Cys Gly Asp Asp Ser lie Arg Tyr Pro Tyr 
3 5 40 45 

CGT CCA GAT ACC ATC ACC CAC GGG CTC ATG GCT GGG GTC ACC ATC ACG 192 
Arg Pro Asp Thr lie Thr His Gly Leu Met Ala Gly Val Thr lie Thr 
50 55 60 

GCC ACC GTC ATC CTT GTC TCG GCC GGG GAA GCC TAC CTG GTG TAC ACA 240 
Ala Thr Val lie Leu Val Ser Ala Gly Glu Ala Tyr Leu Val Tyr Thr 
65 70 75 

GAC CGG CTC TAT TCT CGC TCG GAC TTC AAC AAC TAC GTG GCT GCT GTA 288 
Asp Arg Leu Tyr Ser Arg Ser Asp Phe Asn Asn Tyr Val Ala Ala Val 
80 ~ 85 90 95 

TAC AAG GTG CTG GGG ACC TTC CTG TTT GGG GCT GCC GTG AGC CAG TCT 336 
Tyr Lys Val Leu Gly Thr Phe Leu Phe Gly Ala Ala Val Ser Gin Ser 
100 105 110 

CTG ACA GAC CTG GCC AAG TAC ATG ATT GGG CGT CTG AAG CCC AAC TTC 384 
Leu Thr Asp Leu Ala Lys Tyr Met lie Gly Arg Leu Lys Pro Asn Phe 
115 120 125 

CTA GCC GTC TGC GAC CCC GAC TGG AGC CGG GTC AAC TGC TCG GTC TAT 432 
Leu Ala Val Cys Asp Pro Aso Trp Ser Arg Val Asn Cys Ser Val Tyr 
13 0 13 5 140 

GTG CAG CTG GAG AAG GTG TGC AGG GGA AAC CCT GCT GAT GTC ACC GAG 480 
Val Gin Leu Glu Lys Val Cys Arg Gly Asn Pro Ala Asp Val Thr Glu 
145 150 155 

GCC AGG TTG TCT TTC TAC TCG GGA CAC TCT TCC TTT GGG ATG TAC TGC 528 
Ala Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Gly Met Tyr Cys 
160 ~ 165 170 175 



BNSDOCID: <WO 9846730A1 J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 PCT/US98/07928 

33 



ATG GTG TTC TTG GCG CTG TAT GTG CAG GCA CGA CTC TGT TGG AAG TGG 
Met Val Phe Leu Ala Leu Tyr Val Gin Ala Arg Leu Cys Trp Lys Trp 
180 185 190 



245 250 

CTG AAG GAG GAG GAG CTG GAA CGG AAG CCC AGC CTG TCA CTG ACG TTG 
Leu Lys Glu Glu Glu Leu Glu Arg Lys Pro Ser Leu Ser Leu Thr Leu 
260 265 270 

ACC CTG GGG CGA GGC TG ACCACAACCA CTTATGGGAT ACCCGCACTC 
Thr Leu Gly Arg Gly 
275 

TTCTTCCTGA GGCCGGACCC CGCCCAGGCA GGGAGCTGCT GTGAGTCCAG CTGATGCCCA 
CCCAGGTGGT CCCTCCAGCC TGGTTAGGCA CTGAGGGTTC TGGACGGGCT CCAGGAACCC 
TGGGCTGATG GGAGCAGTGA GCGGTTCCGC TGCCCCCTGC CCTGCACTGG ACCAGGAGTC 
TGGAGATGCC TGGGTAGCCC TCAGCATTTG GAGGGGAACC TGTTC CCGTC GGTCCCCAAA 
TATCCCCTTC TTTTTATGGG GTTAAGGAAG GGACCGAGAG ATCAGATAGT TGCTGTTTTG 
TAAAATGTAA TGTATATGTG GTTTTTAGTA AAATAGGGCA CCTGTTTCAC AAAAAAAAAA 
AAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Gin Arg Arg Trp Val Phe Val Leu Leu .Asp Val Leu Cys Leu Leu 

1 5 1° 

Val Ala Ser Leu Pro Phe Ala lie Leu Thr Leu Val Asn Ala Pro Tyr 



. Lys Arg Gly Phe Tyr Cys Gly Asp Asp Ser He Arg Tyr Pro Tyr Arg 
35 40 
Pro Asp Thr. He Thr His Gly Leu Met Ala Gly Val Thr lie Thr Ala 



576 



CCA CGG CTG CTG CGA CCC ACA GTC CAG TTC TTC CTG GTG GCC TTT GCC 624 
Ala Arg Leu Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala 
XSS 200 205 



CTC TAC GTG GGC TAC ACC CGC GTG TCT GAT TAC AAA CAC CAC TGG AGC 672 
Leu Tyr Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser 
210 215 220 

GAT GTC CTT GTT GGC CTC CTG CAG GGG GCA CTG GTG GCT GCC CTC ACT 
Asp Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr 
225 230 235 

CTC TGC TAC ATC TCA GAC TTC TTC AAA GCC CGA CCC CCA CAG CAC TGT 
Val Cys Tyr He Ser Asp Phe Phe Lys Ala Arg Pro Pro Gin His Cys 

2 ->jic 250 255 



720 



768 



816 

863 

923 
983 
1043 
1103 
1163 
1223 
1232 
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Thr Val lie Leu Val Ser Ala Gly Glu Ala Tyr Leu Val Tyr Thr Asp 
65 70 75 80 

Arg Leu Tyr Ser Arg Ser Asp Phe Asn Asn Tyr Val Ala Ala Val Tyr 
85 90 95 

Lys Val Leu Gly Thr Phe Leu Phe Gly Ala Ala Val Ser Gin Ser Leu 
100 105 110 

Thr Asp Leu Ala Lys Tyr Met lie Gly Arg Leu Lys Pro Asn Phe Leu 
115 120 125 

Ala Val Cys Asp Pro Asp Trp Ser Arg Val Asn Cys Ser Val Tyr Val 
130 135 140 

Gin Leu Glu Lys Val Cys Arg Gly Asn Pro Ala Asp Val Thr Glu Ala 
145 150 155 160 

Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Gly Mec Tyr Cys Met 
165 170 175 

Val Phe Leu Ala Leu Tyr Val Gin Ala Arg Leu Cys Tro Lys Trp Ala 
180 185 " 190 

Arg Leu Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala Leu 
195 200 205 

Tyr Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
210 215 220 

Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr Val 
225 230 235 240 

Cys Tyr He Ser Asp Phe Phe Lys Ala Arg Pro Pro Gin His Cys Leu 
245 250 255 

Lys Glu Glu Glu Leu Glu Arg Lys Pro Ser Leu Ser Leu Thr Leu Thr 
260 265 270 

Leu Gly Arg Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val He Cys 
1 5 10 ~ is 

Val Leu Leu Ala Gly Leu Pro Phe Ala He Leu Thr Ser Arg His Thr 
20 25 30 

Pro Phe Gin Arg Gly He Phe Cys Asn Asp Asp Ser He Lys Tyr Pro 
35 40 45 

Tyr Lys Glu Asp Thr He Pro Tyr Ala Leu Leu Glv Gly He Val He 
50 55 60* 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 PCT/US98/07928 

35 . 

Pro Phe Cys He He Val Met Ser He Gly Glu Ser Leu Ser Val Tyr 
-65 70 " 75 80 

Phe Asn Val Leu His Ser Asn Ser Phe Val Gly Asn Pro Tyr He Ala 
85 90 95 

Thr He Tyr Lys Ala Val Gly Ala Phe Leu Phe Gly Val Ser Ala Ser 
100 105 110 

Gin Ser Leu Thr Asp He Ala Lys Tyr Thr He Gly Ser Leu Arg Pro 
115 120 125 

His Phe Leu Ala He Cys Asn Pro Asp Trp Ser Lys He Asn Cys Ser 
130 135 140 

Asp Gly Tyr He Glu Aso Tyr He Cys Gin Gly Asn Glu Glu Lys Val 
14 5 150 155 160 

Lvs Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met 
165 170 175 

Tvr cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly 
180 185 190 

Asp Trp Ala Arg Leu Leu Arg Pro Met Leu Gin Phe Gly Leu He Ala 
195 200 205 

Phe Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His His 
210 J 215 220 

Trp Ser Asp Val Thr Val Gly Leu He Gin Gly Ala Ala Met Ala He 
225 230 235 240 

Leu Val Ala Leu Tyr Val Ser Asp Phe Phe Lys Asp Thr His Ser Tyr 
245 250 255 

Lvs Glu Arq Lys Glu Glu Asp Pro His Thr Thr Leu His Glu Thr Ala 
y 260 265 270 

Ser Ser Arg Asn Tyr Ser Thr Asn His Glu Pro 
275 280 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 
1 5 10 15 

Val Leu Leu Ala Gly Leu Pro Phe Ala He Leu Thr Ser Arg His Thr 
20 * 25 30 

Pro Phe Gin Arg Gly Val Phe Cys Asn Asp Glu Ser He Lys Tyr Pro 
35 40 45 

Tvr Lys Glu Asp Thr He Pro Tyr Ala Leu Leu Gly Gly He He He 
50 55 60 
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Pro Phe Ser lie He Val He lie Leu Gly Glu Thr Leu Ser Val Tyr 
65 70 

Cys Asn Leu Leu His Ser Asn Ser Phe lie Arg Asn Asn Tyr lie Ala 

Thr He Tyr Lys Ala He Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser 
100 105 

Gin ser Leu Thr Asp He Ala Lys Tyr Ser He Gly Arg Leu Arg Pro 
115 120 12s 

His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys Ser 
130 135 1 

Asp Gly Tyr He Glu Tyr Tyr He Cys Arg Gly Asn Ala Glu Arg Val 
£ 4 | 150 155 «° 

Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met 



165 



Tvr Cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly 
180 185 190 

Asp Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val Ala 

Val Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His His 

210 215 220 

Trp Ser Asp Val Leu Thr Gly Leu He Gin Gly Ala Leu Val Ala He 
225 230 235 

Leu val Ala Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe 
245 250 255 

Lys Glu Arg Lys Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro 
260 265 270 

Thr Thr Gly Asn His Tyr Pro Ser Asn His Gin Pro 
275 280 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 
1 5 10 15 

Val Leu Leu Ala Ser Met Pro Met Ala Val Leu Lys Leu Gly Gin He 
20 25 30 

Tvr Pro Phe Gin Arg Gly Phe Phe Cys Lys Asp Asn Ser He Asn Tyr 
35 40 45 

Pro Tyr His Asd Ser Thr Ala Ala Ser Thr Val Leu He Leu Val Gly 
50 ~ 55 60 
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Val Gly Leu Pro Val Ser Ser lie lie Leu Glv Glu Thr Leu Ser Val 
65 70 75 80 

Tyr Cys Asn Leu Leu His Ser Asn Ser Phe lie Arg Asn Asn Tyr lie 
85 90 95 

Ala Thr lie Tyr Lys Ala He Gly Thr Phe Leu Phe Gly Ala Ala Ala 
100 105 110 

Ser Gin Ser Leu Thr Asp He Ala Lys Tyr Ser He Gly Arg Leu Arg 
115 120 125 

Pro His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys 
130 ~ 135 140 

Ser Asp Gly Tyr He Glu Tyr Tyr He Cys Arg Gly Asn Ala Glu Arg 
145 * " 150 155 160 

Val Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser 
165 170 175 

Met Tyr Cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys 
180 185 190 

Gly Asp Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val 
195 200 205 

Ala Val Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His 
210 215 220 

His Trp Ser Asp Val Leu Thr Gly Leu He Gin Gly Ala Leu Val Ala 
225 ' 230 235 240 

He Leu Val Ala Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser 
245 250 255 

Phe Lys Glu Arg Lys Glu Glu Asp Ser His Thr Thr Leu His Glu Thr 
260 265 270 

Pro Thr Thr Gly Asn His Tyr Pro Ser Asn His Gin Pro 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Gin Asn Tyr Lys Tyr Asp Lys Ala He Val Pro Glu Ser Lys Asn 
1 5 10 15 

Glv Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys 
20 25 30 

Arq Val Leu Leu He Cys Leu Asp Leu Phe Cys Leu Phe Met Ala Gly 
35 40 45 

Leu P-o Phe Leu He He Glu Thr Ser Thr He .Lys Pro Tyr His Arg 
50 55 60 
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Gly Phe Tyr Cys Asn Asp Glu Ser lie Lys Tyr Pro Leu Lvs Thr Gly 
6 5 " " 7Q ?5 80 

Glu Thr He Asn Asp Ala Val Leu Cys Ala Val Gly He Val He Ala 
85 90 95 

He Leu Ala He He Thr Gly Glu Phe Tyr Arg He Tyr Tyr Leu Lys 
100 105 110 

Lys Ser Arg Ser Thr He Gin Asn Pro Tyr Val Ala Ala Leu Tyr Lys 
115 120 125 

Gin Val Gly Cys Phe Leu Phe Gly Cys Ala He Ser Gin Ser Phe Thr 
130 ~ 135 140 

Asp He Ala Lys Val Ser He Gly Arg Leu Arg Pro His Phe Leu Ser 
145 150 155 160 

Val Cys Asn Pro Asp Phe Ser Gin He Asn Cys Ser Glu Gly Tyr He 
165 170 175 

Gin Asn Tyr Arg Cys Arg Gly Asp Asp Ser Lys Val Gin Glu Ala Arg 
180 185 190 

Lys Ser Phe Phe Ser Gly His Ala Ser Phe Ser Met Tyr Thr Met Leu 
195 200 205 

Tyr Leu Val Leu Tyr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg 
210 215 220 

Leu Leu Arg Pro Leu Leu Gin Phe Thr Leu He Met Met Ala Phe Tyr 
225 230 235 240 

Thr Gly Leu Ser Arg Val Ser Asp His Lys His His Pro Ser Asp Val 
245 250 255 

Leu Ala Gly Phe Ala Gin Gly Ala Leu Val Ala Cys Cys He Val Phe 
260 265 270 

Phe Val Ser Asp Leu Phe Lys Thr Lys Thr Thr Leu Ser Leu Pro Ala 
275 280 285 

Pro Ala He Arg Lys Glu He Leu Ser Pro Val Asp He He Asp Arg 
290 295 300 

Asn Asn His His Asn Met Met 
305 310 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Gin Arg Arg Trp Val Phe Val Leu Leu Asp Val Leu Cys Leu Leu 
15 10 15 

Val Ala Ser Leu Pro Phe Ala He Leu Thr Leu Val Asn Ala Pro Tyr 

20 25 30 
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Lys Arg Gly Phe Tyr Cys Gly Aso Asp Ser lie Arg Tyr Pro Tyr Arg 
35 40 45 

Pro Asp Thr He Thr His Gly Leu Met Ala Gly Val Thr lie Thr Ala 
50 55 ' 60 

Thr Val He Leu Val Ser Ala Gly Glu Ala Tyr Leu Val Tyr Thr Asp 
65 70 75 80 

Arg Leu Tyr Ser Arg Ser Asp Phe Asn Asn Tyr Val Ala Ala Val Tyr 
85 90 95 

Lys Val Leu Gly Thr Phe Leu Phe Gly Ala Ala Val Ser Gin Ser Leu 
100 105 110 

Thr Asp Leu Ala Lys Tyr Met He Gly Arg Leu Lys Pro Asn Phe Leu 
115 120 " 125 

Ala Val Cys Asp Pro Asp Tro Ser Arg Val Asn Cys Ser Val Tyr Val 
130 135 140 

Gin Leu Glu Lys Val Cys Arg Gly Asn Pro Ala Asp Val Thr Glu Ala 
145 150 155 *" 160 

Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Gly Met Tyr Cys Met 
165 170 ' 175 

Val Phe Leu Ala Leu Tyr Val Gin Ala Arg Leu Cys Trp Lys Trp Ala 
180 185 190 

Arg Leu Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala Leu 
195 200 205 

Tyr Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
210 215 220 

Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr Val 
225 230 235 240 

Cys Tyr He Ser Asp Phe Phe Lvs Ala Arg Pro Pro Gin His Cys Leu 
245 250 255 

Lys Glu Glu Glu Leu Glu Arc Lvs Pro Ser Leu Ser Leu Thr Leu Thr 
260 265 270 

Leu Gly Arg Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GGCT CTAG AT ATTAATAGTA ATCAATTAC 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCTCACGCAT GCACCATGGT AATAGC 26 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGTGCATGCG TGAGGCTCCG GTGC 24 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTAGTTTTCA CGGTACCTGA AATGGAAG 2 8 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
GGCATGGTAC CATGTTTGAC AAGACGCGGC 30 
(2) INFORMATION FOR SEQ ID NO : 19 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CATATGTAGT ATTCAATGTA ACC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TGATGGCTAG CATGCAGAGA AGATGGGTCT TCGTGCTGCT CGACGTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGTGCGGGAT CCCATAAGTG GTTG 
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What Is Claimed Is: 

1 . An isolated polynucleotide encoding human 
phosphatidic acid phosphatase wherein said polynucleotide 
encodes a protein comprising a polypeptide sequence 
5 selected from the group consisting of (i) the sequence at 

amino acid number 1 to amino acid number 2 84 in Figure 1 
(SEQ ID NO:2) , (ii) the sequence at amino acid number 1 
to amino acid number 285 in Figure 2 (SEQ ID N0:4) , and 
(iii) the sequence at amino acid number 1 to amino acid 
10 number 276 in Figure 4 (SEQ ID NO: 8) . 

2 . An isolated human phosphatidic acid phosphatase 
protein, wherein said protein comprises a polypeptide 
sequence selected from the group consisting of (i) the 
15 sequence at amino acid number 1 to amino acid number 2 84 

in Figure 1 (SEQ ID NO:2), (ii) the sequence at amino 
acid number 1 to amino acid number 2 85 in Figure 2 (SEQ 
ID NO: 4) , and (iii) the sequence at amino acid number 1 
to amino acid number 276 in Figure 4 (SEQ ID NO: 8) . 

20 

3 . A method of preparing a human phosphatidic acid 
phosphatase -j3 protein comprising the steps of (i) 
transforming a host cell with an expression vector 
comprising a polynucleotide encoding human phosphatidic 
25 acid phosphatase, (ii) culturing said transformed host 

cells which express said protein and (iii) isolating said 
protein. 

4 . The method of claim 3 , wherein said 
3 0 polynucleotide encoding human phosphatidic acid is 

selected from the group consisting of (i) the sequence at 
amino acid number 1 to amino acid number 284 in Figure 1 
(SEQ ID NO:2) , (ii) the sequence at amino acid number 1 
to amino acid number 2 85 in Figure 2 (SEQ ID NO:4), (iii) 
35 the sequence at amino acid number 1 to amino acid number 

311 in Figure 3 (SEQ ID NO: 6) , and (iv) the sequence at 
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amino acid number 1 to amino acid number 276 in Figure 4 
(SEQ ID NO: 8) . 

5. A method of dephosphorylating a substrate 
comprising recombinantly producing a human phosphatidic 
acid phosphatase protein and contacting said substrate 
with an effective amount of said recombinantly produced 
human phosphatidic acid phosphatase protein such that 
said protein catalyzes the dephosphorylation of said 
substrate . 

6. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 2 84 in Figure 1 (SEQ ID NO: 2) . 

7. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 285 in Figure 2 (SEQ ID NO: 4) . 

8. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 311 in Figure 3 (SEQ ID NO: 6) . 

9. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 276 in Figure 4 (SEQ ID NO: 8) . 

10. The method of claim 5, wherein said substrate 
is selected from the group consisting of phosphatidic 
acid, lysophosphatidic acid, ceramide 1-phosphate, and 
sphingosine 1 -phosphate . 

11. The method of claim 5, wherein said contacting 
is effected in vitro, and further comprises the step of 
isolating said dephosphoryled substrate. 
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12. The method of claim 5, wherein said contacting 
step occurs in vivo and is effected by the administration 
of said human phosphatidic acid phosphatase to a mammal 
in need thereof . 

13. A method of dephosphorylating a substrate 
comprising contacting said substrate with an effective 
amount of isolated human phosphatidic acid phosphatase 
protein such that said protein catalyzes the. 
dephosphorylation of said substrate, wherein said 
substrate is selected from the group consisting of 
lysophosphatidic acid, ceramide 1-phosphate, and 
sphingosine 1-phosphate. 
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Fig. 1A 



75 



100 



G1 y Ala Ala 
105 110 



130 



AAT GCA GAA AGA GTT AAG GAA GGC AGG TTG TCC TTC TAT rCR rrr 
Asn Ala Glu Arg Val Lys Glu Gly Arg Leu Ser lie lyl Ser Gly 

CAC TCT TCG TTT TCC ATG TAC TGC ATG CTG TTT GTG GCA CTT TAT 896 
His Ser Ser Phe Ser Met Tyr Cys Met Leu Phe Val Ala Leu lyl 
175 180 ion 

CTT CAA GCC AGG ATG AAG GGA GAC TGG GCA AGA CTC TTA CGC err Q41 
Leu Gin Ala Arg Met Lys Gly Asp Trp Ala Arg Leu Leu Arg Pro 

195 ^nn 
ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT TAT GTG GGC CTT 98 6 

Thr Leu Gin Phe Gly Leu Val Ala Val Ser lie Tyr Val £S Leu 

205 210 Ol R 

TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT GTG TTG ACT 1031 

Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp Val Leu Thr 

220 225 o-»n 

GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT GTA TAT 107 6 

Gly Leu He Gin Gly Ala Leu Val Ala He Leu Val Ala Val Tyr 

235 240 ->a^ 

GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA AAA 1121 

Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe l£ g?S ArJ JJJ ' 

255 260 
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122 
182 
242 



Met Phe Asp Lys Thr 

CGG CTG CCG TAC GTG GCC CTC GAT GTG CTP rrr rvn ~~ 5 

Ar 3 Leu Pro Tyr Val Ala Leu Asp "J J* Ifs vS Leu Leu SI ^ 

GGA TTG CCT TTT GCA ATT CTT ACT TCA AGG r»T srr 20 
Gly Leu Pro Phe Ala lie Leu Thr S S g SI }£ ~ £ ~ 

30 

CGA GGA GTA TTC TGT AAT GAT GAG TCC ATP anr -r*n ^ 35 

AT, Gly Val Phe Cys Asn Asp £ J£ £ "I lyr PrI ryr J£ 4 " 

Su Asp s s si s a s » s s iii Pro »« 

E E S £ 511 SI £ £ ~ | S « E « «£ », 



TGT AAC CTT TTG CAC TCA AAT TCC TTT ATP nrr a*™ ™„ * 80 
cys As„ Leu Leu His Ser Asn Ser III S S g Asl j£ ryr lie 

GCC ACT ATT TAC AAA GCC ATT GGA ACC TTT TTA TTT rrr rra nn» 
Ala Thr lie Tyr Lys Ala He Gly Thr Phe llu Phe Glv S £1 671 



GCT AGT CAG TCC CTG ACT GAC ATT GCC AAG TAT Tra w .i 

Ala Ser Gin Ser Leu Thr Asp He S J£ ™ He Gly £g 7 " 

CTG CGG CCT CAC TTC TTG GAT GTT TGT GM CCA GAT TGG TCA AAA 761 
Leu Arg Pro Hxs Phe Leu Asp Val Cys Asp Pro Asp Trp Ser JJi 



ATC AAC TGC AGC GAT GGT TAC ATT GAA TAC TAC ATA TGT CCA rrr 
lie Asn Cys Ser Asp Gly Tyr He Glu Tyr He* Cys Arg Gly* ^ 

14 5 150 
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Fig. 1B 

GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT GGG 1166 
Su Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr Gly 

270 £.10 



265 



AAT CAC TAT CCG AGC AAT CAC CAG CCT TGA AAG GCAGCAGGGTGCCCAG 1215 
Asn His Tyr Pro Ser Asn His Gin Pro 
280 



GTGAAGCTGGCCTGTTTTCTAAAGGAAAATGATTGCCACAAGGCAAGAGGATGCATCTTT 

St^tggtgtacaagcctttaaagacttctgctgctgatatgcctct 
^tgc™gta^ 

ataatacatattaaaatgtatgggagaaccaaaaaaaaaaaaaaaaaa 



1275 
1335 
1395 
1455 
1515 
1563 
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Fig. 2A 



CCTGTGGGAGAGAGCGCCGGGATCCGGACGGGGTAGCAACCGGGGCAGGCCGTGCCGGCTGA 62 
GGAGGTCCTGAGGCT ACAGAGCTGCCGCGGCTGGCACACGAGCGCCTCGGC ACT AACCGA 122 
GTGTTCGCGGGGGCTGTGAGGGGAGGGCCCCGGGCGCCATTGCTGGCGGTGGGAGCGCCG 182 
CCCGGTCTCAGCCCGCCCTCGGCTGCTCTCCTCCTCCGGCTGGGAGGGGCCGTATCTCGG 24 2 
GGCCGTCGCCAGCCCCGGCCCGGGCTCGATAATCAAGGGCCTCGGCCGTCGTCCCGCACC 302 
TCATTCCATCGCCCTTGCCGGGCAGCCCGGGCAGAGACC ATG TTT GAC AAG ACG 356 

Met Phe Asp Lys Thr 

5 

CGG CTG CCG TAG GTG GCC CTC GAT GTG CTC TGC GTG TTG CTG GCT 4 01 

Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys Val Leu Leu Ala 
10 15 20 

TCC ATG CCT ATG GCT GTT CTA AAA TTG GGC CAA ATA TAT CCA TTT 44 6 

Ser Met Pro Met Ala Val Leu Lys Leu Gly Gin lie Tyr Pro Phe 
25 30 35 

CAG AGA GGC TTT TTC TGT AAA GAC AAC AGC ATC AAC TAT CCG TAC 4 91 

Gin Arg Gly Phe Phe Cys Lys Asp Asn Ser lie Asn Tyr Pro Tyr 
40 45 50 

CAT GAC AGT ACC GCC GCA TCC ACT GTC CTC ATC CTA GTG GGG GTT 536 
His Asp Ser Thr Ala Ala Ser Thr Val Leu lie Leu Val Gly Val 
55 60 65 

GGC TTG CCC GTT TCC TCT ATT ATT CTT GGA GAA ACC CTG TCT GTT" 581 
Gly Leu Pro Val Ser Ser lie lie Leu Gly Glu Thr Leu Ser Val 
70 75 80 

TAC TGT AAC CTT TTG CAC TCA AAT TCC TTT ATC AGT AAT AAC TAC 626 
Tyr Cys Asn- Leu Leu His Ser Asn Ser Phe lie Ser Asn Asn Tyr 
85 90 .95 

ATA GCC ACT ATT TAC AAA GCC ATT GGA ACC TTT TTA TTT GGT GCA 671 
He Ala Thr He Tyr Lys Ala He Gly Thr Phe Leu Phe Gly Ala 
100 105 HO 

GCT GCT AGT CAG TCC CTG ACT GAC ATT GCC AAG TAT TCA ATA GGC 716 
Ala Ala Ser Gin Ser Leu Thr Asp He Ala Lys Tyr Ser He Gly 
115 120 125 

AGA CTG CGG CCT CAC TTC TTG GAT GTT TGT GAT CCA GAT TGG TCA 7 61 

Arg Leu Arg Pro His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser 
130 135 140 

AAA ATC AAC TGC AGC GAT GGT TAC ATT GAA TAC TAC ATA TGT CGA 80 6 

Lys He Asn Cys Ser Asp Gly Tyr He Glu Tyr Tyr He Cys Arg 
145 150 155 

GGG AAT GCA GAA AGA GTT AAG GAA GGC AGG TTG TCC TTC TAT TCA 851 
Gly Asn Ala Glu Arg Val Lys Glu Gly Arg Leu Ser Phe Tyr Ser 
160 165 170 

GGC CAC TCT TCG TTT TCC ATG TAC TGC ATG CTG TTT GTG GCA CTT 896 
Gly His Ser Ser Phe Ser Met Tyr Cys Met Leu Phe Val Ala Leu 
175 " 180 185 

TAT CTT CAA GCC AGG ATG AAG GGA GAC TGG GCA AGA CTC TTA CGC 941 
Tyr Leu Gin Ala Arg Met Lys Gly Asp Trp Ala Arg Leu Leu Arg 
190 " 195 200 

CCC ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT TAT GTG GGC 98 6 

Pro Thr Leu Gin Phe Gly Leu Val Ala Val Ser He Tyr Val Gly 
205 210 215 

CTT TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT GTG TTG 1031 
Leu Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp Val Leu 
220 225 230 

ACT GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT GTA - 107 6 
Thr Gly Leu He Gin Gly Ala Leu Val Ala He Leu Val Ala Val 
235 240 245 
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Fig. 2B 

TAT GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA 1121 
Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe Lys Glu Arg 
250 260 
AAA GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT 1166 
Lvs Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr 
265 270 275 

GGG AAT CAC TAT CCG AGC AAT CAC CAG CCT TGA AAGGCAGCAGGGTGCC 1215 
Gly Asn His Tyr Pro Ser Asn His Gin Pro *** 
280 285 



CAGGTGAAGCTGGCCTGTTTTCTAAAGGAAAATGATTGCCACAAGGCAAGAGGATGCATC 
TTTCTTCCTGGTGTACAAGCCTTTAAAGACTTCTGCTGCTGATATGCCTCTTGGATGCAC 
ACTTTGTGTGTACATAGTTACCTTTAACTCAGTGGTTATCTAATAGCTCTAAACTCATTA 
AAAAAACTCCAAGCCTTCCACCAAAACAGTGCCCCACCTGTATACATTTTTATTAAAAAA 
ATGTAATGCTTATGTATAAACATGTATGTAATATGCTTTCTATGAATGATGTTTGATTTA 
AATATAATACATATTAAAATGTATGGGAGAACCAAAAAAAAAAAAAAAAAA 



1275 
1335 
1395 
1455 
1515 
1566 
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Fig. 3 A 



GGCGCAGCTCTGCAAAAGTTTCTGCTCGGGATCTGGCTCTCTTCCCCTTGGACTTTAGAACG 62 
ATTTAGGGTTGACAGAGGAAAGCAGAGGCGCGCAGGAGGAGCAGAAAACACCACCTTCTG 122 
CAGTTGGAGGCAGGCAGCCCCGGCTGCACTCTAGCCGCCGCGCCCGGAGCCGGGGCCGAC 182 
CCGCCACTATCCGCAGCAGCCTCGGCCAGGAGGCGACCCGGGCGCCTGGGTGTGTGGCTG 242 
CTGTTGCGGGACGTCTTCGCGGGGCGGGAGGCTCGCGCCGCAGCCAGCGCC ATG CAA 299 

Met Gin 

AAC TAC AAG TAC GAC AAA GCG ATC GTC CCG GAG AGC AAG AAC GGC 34 4 

Asn Tyr Lys Tyr Asp Lys Ala lie Val Pro Glu Ser Lys Asn Gly 

5 10 15 

GGC AGC CCG GCG CTC AAC AAC AAC CCG AGG AGG.AGC GGC AGC AAG 389 
Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys 

20 25 30 

CGG GTG CTG CTC ATC TGC CTC GAC CTC TTC TGC CTC TTC ATG GCG 4 34 

Arg Val Leu Leu lie Cys Leu Asp Leu Phe Cys Leu Phe Met Ala 

35 40 45 

GGC CTC CCC TTC CTC ATC ATC GAG ACA AGC ACC ATC AAG CCT TAC 479 
Gly Leu Pro Phe Leu lie lie Glu Thr Ser Thr lie Lys Pro Tyr 

50 55 60 

CAC CGA GGG TTT TAC TGC AAT GAT GAG AGC ATC AAG TAC CCA CTG 52 4 

His Arg Gly Phe Tyr Cys Asn Asp Glu Ser lie Lys Tyr Pro Leu 

65 70 75 

AAA ACT GGT GAG ACA ATA AAT GAC GCT GTG CTC TGT GCC GTG GGG 569 
Lys Thr Gly Glu Thr lie Asn Asp Ala Val Leu Cys Ala Val Gly 

80 85 90 

ATC GTC ATT GCC ATC CTC GCG ATC ATC ACG GGG GAA TTC TAC CGG 614 
lie Val lie Ala lie Leu Ala lie lie Thr Gly Glu Phe Tyr Arg 

95 100 105 

ATC TAT TAC CTG AAG AAG TCG CGG TCG ACG ATT CAG AAC CCC TAC 659 
lie Tyr Tyr Leu Lys Lys Ser Arg Ser Thr lie Gin Asn Pro Tyr 

110 115 120 

GTG GCA GCA CTC TAT AAG CAA GTG GGC TGC TTC CTC TTT GGC TGT 7 04 

Val Ala Ala Leu Tyr Lys Gin Val Gly Cys Phe Leu Phe Gly Cys 

125 130 135 

GCC ATC AGC CAG TCT TTC ACA GAC ATT GCC AAA GTG TCC ATA GGG 7 49 

Ala lie Ser Gin Ser Phe Thr Asp lie Ala Lys Val Ser lie Gly 

140 135 150 

CGC CTG CGT CCT CAC TTC TTG AGT GTC TGC AAC CCT GAT TTC AGC 7 94 

Arg' Leu Arg Pro His Phe Leu Ser Val Cys Asn Pro Asp Phe Ser 

155 160 165 

CAG ATC AAC TGC TCT GAA GGC TAC ATT CAG AAC TAC AGA TGC AGA 8 39 

Gin lie Asn Cys Ser Glu Gly Tyr lie Gin Asn Tyr Arg Cys Arg 

170 180 
GGT GAT GAC AGC AAA GTC CAG GAA GCC AGG AAG TCC TTC TTC TCT 8 84 

Gly Asp Asp Ser Lys Val Gin Glu Ala Arg Lys Ser Phe Phe Ser 
185 190 195 

. GGC CAT GCC TCC TTC TCC ATG TAC ACT ATG CTG* TAT TTG GTG CTA 929 
Gly His Ala Ser Phe Ser Met Tyr Thr Met Leu Tyr Leu Val Leu 

200 205 210 

TAC CTG CAG GCC CGC TTC ACT TGG CGA GGA GCC CGC CTG CTC CGG 97 4 

Tyr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg Leu Leu Arg 

215 220 225 

CCC CTC CTG CAG TTC ACC TTG ATC ATG ATG GCC TTC TAC ACG GGA 1019 
Pro Leu Leu Gin Phe Thr Leu lie Met Met Ala Phe Tyr Thr Gly 

230 235 240 

CTG TCT CGC GTA TCA GAC CAC AAG CAC CAT CCC AGT GAT GTT CTG 1064 
Leu Ser Arg Val Ser Asp His Lys His His Pro Ser Asp Val Leu 
245 250 255 
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Fig. 4 A 

ACC ATG CAG CGG AGG TGG GTC TTC GTG CTG CTC GAC GTG CTG TGC 47 
Met Gin Arg Arg Trp Val Phe Val Leu Leu Asp Va'l Leu Cys 

5 10 
TTA CTG GTC GCC TCC CTG CCC TTC GCT ATC CTG ACG CTG GTG AAC 92 
Leu Leu Val Ala Ser Leu Pro Phe Ala lie Leu Thr Leu Val Asn 

15 20 25 

GCC CCG TAC AAG CGA GGA TTT TAC TGC GGG GAT GAC TCC ATC CGG 137 
Ala Pro Tyr Lys Arg Gly Phe Tyr Cys Gly Asp Asp Ser lie Arg 

30 " 35 ,40 

TAC CCC TAC CGT CCA GAT ACC ATC ACC CAC GGG CTC ATG GCT GGG 182 
Tyr Pro Tyr Arg Pro Asp Thr lie Thr His Gly Leu Met Ala Gly 

45 " 50 55 

GTC ACC ATC ACG GCC ACC GTC ATC CTT GTC TCG GCC GGG GAA GCC 227 
Val Thr lie Thr Ala Thr Val lie Leu Val Ser Ala Gly Glu Ala 

60 65 70 

TAC CTG GTG TAC ACA GAC CGG CTC TAT TCT CGC TCG GAC TTC AAC 272 
Tyr Leu Val Tyr Thr Asp Arg Leu Tyr Ser Arg Ser Asp Phe Asn 

75 80 85 

AAC TAC GTG GCT GCT GTA TAC AAG GTG CTG GGG ACC TTC CTG TTT 317 
Asn Tyr Val Ala Ala Val Tyr Lys Val Leu Gly Thr Phe Leu Phe 

90 95 100 

GGG GCT GCC GTG AGC CAG TCT CTG ACA GAC CTG GCC AAG TAC ATG 362 
Gly Ala Ala Val Ser Gin Ser Leu Thr Asp Leu Ala Lys Tyr Met 
105 HO 115 

ATT GGG CGT CTG AAG CCC AAC TTC CTA GCC GTC TGC GAC CCC GAC 4 07 

lie Gly Arg Leu Lys Pro Asn Phe Leu Ala Val Cys Asp . Pro Asp 
120 125 130 

TGG AGC CGG GTC AAC TGC TCG GTC TAT GTG CAG CTG GAG AAG GTG 452 
Trp Ser Arg Val Asn Cys Ser Val Tyr Val Gin Leu Glu Lys Val 
135 140 145 

TGC AGG GGA AAC CCT GCT GAT GTC ACC GAG GCC AGG TTG TCT TTC 4 97 

Cys Arg Gly Asn Pro Ala Asp Val Thr Glu Ala Arg Leu Ser Phe 
150 155 " 160 

TAC TCG GGA CAC TCT TCC TTT GGG ATG TAC TGC ATG GTG TTC TTG 54 2 

Tyr Ser Gly His Ser Ser Phe Gly Met Tyr Cys Met Val Phe Leu 
165 170 175 

GCG CTG TAT GTG CAG GCA CGA CTC TGT TGG AAG TGG GCA CGG CTG 587 
Ala Leu Tyr Val Gin Ala Arg Leu Cys Trp Lys Trp Ala Arg Leu 
180 185 190 

CTG CGA CCC ACA GTC CAG TTC TTC CTG GTG GCC TTT GCC CTC TAC 632 
Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala Leu Tyr 
195 200 205 

GTG GGC TAC ACC CGC GTG TCT GAT TAC AAA CAC CAC TGG AGC GAT 677 
Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
210 ' 215 220 

GTC CTT GTT GGC CTC CTG CAG GGG GCA CTG GTG GCT GCC CTC ACT 722 
Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr 
225 " 230 235 

GTC TGC TAC ATC TCA GAC TTC TTC AAA GCC CGA CCC CCA CAG CAC 7 67 

. Val Cys Tyr lie Ser Asp Phe Phe Lys Ala Arg Pro Pro Gin His 
240 245 250 

TGT CTG AAG GAG GAG GAG CTG GAA CGG AAG CCC AGC CTG TCA CTG 812 
Cys Leu Lys Glu Glu Glu Leu Glu Arg Lys Pro Ser Leu Ser Leu 
255 260 265 

ACG TTG ACC CTG GGG CGA GGC TGA CCACAACCACTTATGGGATACCCGCACT 8 64 

Thr Leu Thr Leu Gly Arg Gly *** 
270 275 
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Fig. 4B 

CTTCTTCCTGAGGCCGGACCCCGCCCAGGCAGGGAGCTGCTGTGAGTCCAGCTGATGCCC 
arrCAGGTGGTCCCTCCAGCCTGGTTAGGCACTGAGGGTTCTGGACGGGCTCCAGGAACC 
CTGGGCTGATGGGAGCAGTGAGCGGTTCCGCTGCCCCCTGCCCTGCACTGGACCAGGAGT 
CTGGAGATGCCTGGGTAGCCCTCAGCATTTGGAGGGGAACCTGTTCCCGTCGGTCCCCAA 
ATATCCCCTTCTTTTTATGGGGTTAAGGAAGGGACCGAGAGATCAGATAGTTGCTGTTTT 1164 
GTAAAATGTAATGTATATGTGGTTTTTAGTAAAATAGGGCACCTGTTTCACAAAAAAAAA 1224 

AAAAAAAAAA 



924 
984 
1044 
1104 
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Fig. 6 
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Fig. 7 



<L> 

-£ON ON 



V-( V-i ^ 

o op 

+ + 4- 

mm m 
On On On 
(NCN r<i 

CQPQPQ 
WWW 



Ph 

+ 

ON 

W 



PhPm 

+ + 

ON ON 

PQCQ 
WW 



DAGr 




03 

m 



m 

o 
o 

5 



1 23 4 567 89 



11/13 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 



PCT/US98/07928 



Fig. 8 
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Fig. 9 
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